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ERIC 



PREFACE 



The Finnish-Engl lih Contrastivi Project was Started the University 
Of JyvMskyla in the spring of 1974. The project has been carried out l:n 
two partii the Departnient of Phonetics and Generil Linguistics is respan- 
sible for the work on DrQnunciatian problems, and the Department of 
English is concerned with other aspects of the reiearch. Several reports 
have Deen publi'shed previously on phonetically orientated topics (eg. 
Mgisio and Valento 1975^ Suomi 1«76^ Lehtonen and Koponen 1977-, Sajavaara 
and Lehtonen ig78a; Lehtonen 1978} and the results have also been aDplied 
to language teaching purposes in a textbook of spoken English (Lehtonen^ 
Sajavaara and Hay 1977). The methodology and problems of contrastive 
phonecfcs have been discussed by eg, Lehtonen (1977; 197Q). In co- 
operation with the Error Analysis Troject at Abo Akademi , the Contrastive 
Project has also carried out some tests; the results have been suniiiaf ized 
m Palmberg (1979), In addition, the problems concerning the Influence 
of English language and culture on priSent-^day Finnish have been studied 
in co-operation with the Anglicism Project at the University of Jyvaskyla 
(see eg. Sajavaara et al. 1978). 

In traditional contrastive analysis — an analysis of the similarities 
and differences of two sound systems it was supposed that the difficult- 
ies of foreign language students could be described and predicted. The 
'strong hypothesis' of contrastive analysis has been much criticized (see 
eg. Sajavaara 1977). It obviously failed to recognise the Importance 
of actual problems in learning situations which 1s exactly the kind of 
knowledge that the FL teacher needs in his classrooni practice > Moreover i 
the focus was on foreign language production; the role of the listener, 
which is equally important, was grossly underestimated. The listener's 
Impressions of speech derive from Internalized categories of rules and 
structures usually called phonology. Thus, he does not hear the actual 
physical utterance but a chain of segments corresponding to the expec- 
tations about the utterance. This Is typical of human perceptual mechan- 
isms 1n generals we observe structures of whose existence Ha have 
discovered and which we choose to observe. For a speaker-heai^erj language 
is in earphone through which he can hear only language, not speech* Thus, 
many of the acoustic cues which actually carry the information necessary 
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for speech perceptlQ.n are subconscious (s'3. ^ formant patterns of vowels, 
consonantal loci and transitionSi temporal patterns, of ytterancej erc.)- 
Although a spaakar can pay cengcioys attentioF^ to certain pDStures and 
movements of his speech, organs, both the way i\t\ which thoy actually 
operate during speech and thfi' resulting acoustic message are beyond 
the scope of conscic^us observaEiDrK 

All this has important coniequences: the contrastlye analyst and the 
foreign language teacher must rtalize that the way 1n which they 'hear' 
a certain pattern may be totally different to the way in which the 
student perceives the same thing. The probtenis of the foreign lar/gudge 
student can be under? nly if we know How he 'feets,', what he att^tmpts 

to hearj what he ac^ eftrSs whrit the structures are that actually 

perceives, and how jiffsr from the target* ie„ the way the native 

speaker's perception works in similar situations, 

Therefo*"e, the focus of studies should not be on 'pfonuneiation' 
only but on a complex of problems which extend far beyond the area of 
what is traditionally called pronunciation teaching. This conslix 
includes the differences in the learner's LI and 12 communication chains 
as a wholes as well as the problems and cofifuslons arising from the 
fact that the learner unconsciously resorts to LI phenomena not only when 
producing L2 but also when trying to understand L2. 

The majority of the contrastive phonetic studies carried out within 
the Project were, however, limited to specific phonetic phenomena: the 
fortis/lenis distinction * vowel reduction, word and sentence stress * 
consonant clusters * word boundarieSi fluency * pauses, the acceptability 
of the Finnish variety of pfOnunciation * the feliao^i -ity of the marking 
of pronunciation errors etc, (for closer discussion, see eg* Sajavaara 
and Lehtonen 1978b), The present volume i? a final report on the strictly 
phonetic part of the project. %m>B of the topics included in the 'Orilgiinal 
research programme cannot he reported her^s either because they have been 
delayed or the methods a'ppli^d have proved to be inadequate and the results 
are therefore unreliable ^ That is why studies on such important topiics as 
word and sentence stress or rhythm and intonation are not found in this* 
volume. In addition, some of t'he later studies have a wider scope: they 
deal with various aspects of discourse and not with specific Bronunciatlon 
problems. The reports of these studies, which also include non-verbal 
communication analysis, are not included here, sifice a separate issue 
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icatfd to discourse: analysis wir hn publ ishid in this series. 
Two of th^f present papers are concerned with the fort is/ 1 en is 
Stinction, The paper by Suomi is a version of a paper read at the 
.^:h Meeting of Finnish Phoneticians at Jyvaskyla in 1978. It is a 
survey of the recent work on the phonetic parameters of the fortis/ 
lenis distinction and a discussion of the theoretical problems involved. 
In his dUcussion of the problems of labelling the distinction the 
author prefers the terms "voiced-voiceless" to "fortis^leni s% which 
were recofT^ended for Dedaaoaical purDoses by eq. Lehtonen n971/ and 
Gmson {]%t). In a related study, Hanninen has analysed the learr^ing 
difficuUtes encountered by Finns in three different contextual environments 
in isolated i^'ords, in words includid in word lists and in words embedded 
in meaningful sentences. The duratloral pattern of words embedded in 
meaningful sentences appeared to be clearly different from the other 
environment. Acoustical data on the fricatives and affricites also 
turned out to be different from that of the D^osives. The learning 
difficulties of Finns vary a great deal according to the context. In 
identification tests, Finnish students failed to identify niost of the 
tested oppositions. English listeners had considerable difficulty in 
Identifying the opposition as produced by Finnish speakers. Reduction of 
the final vowel in English VCV-structures 1s discussed by Heikkinen. 
As regards the learning p.^oblems of Finnish students, she discevered 
that the duration used by Finns was either influenced by the mother 
tongue ( le. it varied accordiug to the cruantity of the first syllable) 
or It was generally too short. The quality of /a/ used by Finnish speakers 
was usually influenced by lip rounding, which reflects the influence of 
Finnish /o/. Heikkinen also includes a short discussion of co-articulation 
and some tentative results concerning the co-articulatiDn between the 
Initial and final vowels, Lanminrngki deals with certain problems of 
language testing as related to Identification problems. The identification 
cues used by Finns turned out to be dramatically differe/it from those of 
native speakers. Lomrtiinmaki also considers the adequacy of the tests 
available In interlanguage research. Theif effectiyeness in revealing 
the "critical" points in the interlanguage channel is rather limited. 

The inadequacy of present tests of TL skills is partly due to the 
fact that we lack an inteflrated picture of the mechaniims functlnning 
in native^language conwunicatiDn . Indirectly, however, foreign-language 



speech perceptfon my be approached through ar analysis of the learner's 
production. If the features of production that are important in syntactic 
and senantic processing deviate from those of a natfvp speaker, it can 
be predicted that the learner also has difficulties In his perception. 
Similarly, if native-like features are found in the production of the 
learner. It can be expected that he is more or less capable of using the 
same cues as a native speaker in his perception. However, quite unexpected 
elements may be confused with each other. For example, a Finn may fail 
to perceive or produce the fortis/1en1s distinction as a result of the 
long/short distinction of Finnish vowels. Differences in signalling 
higher-level information, such as morphological and syntactic pattims, 
lexical units, or textual features can also result In interference in 
speech connynicatlon. For instance, the process of lexical word detection 
in English may be somewhat different from that of Finnish, due to the 
typological differences {isolating vs, agglutinating) between the two 
languages (see eg, Karlsson 1976^ Lehtonen and Koponen 1977; Ringhom 1979), 

The crucial question In contrastive fi3ld-WQrk is what is ft that we 
want to test? Do we want to test the student's knowledge of linguistic 
structures, or his capacity to cofmunlcate In a foreign language? The 
static view of language emphasizes the Importance of linguistic structures 
as such, whereas the dynamic view of language stressrs the communicative 
situation Itself.' what goes on in the mind and environment of the speaker- 
hearer. The phoneticians' attention has traditionally fae^n focused on the 
linguistic code. The assumption, however, that linguistic categories 
as such could be transferred to the analysis of speech coniTiunI cation is not 
lee1t1nate,Phonimes, morphemes, or junctures, for example, are not necess- 
arily relevant units in either speech processing or language acqyisltion 
(Sajavaara and Lehtonen 1979, Lehtonen and Hurme 1979), As a matter of fact. 
It is far from self-evident that these units have any connection with the 
processing of information in communicative tasks. It was therefore rather 
unfortunate that Items of a structuralist phonolDgical analysis were also 
regarded as basic elements in language teachings phonetic differences, for 
example, were divided into phonological and allophonic ones.- Phonological 
errors were dangerous, while allophonic ones were unimportant. Distinctive 
prosodic features were taught with special care, eg. English stress oppositions 

The narrowness of the traditional phonetic point of view has been 
emphasized by eg, Ladefoged (1977:410). 
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likg 'import/in'Dort, or distinctive intonation patterns likt Tune 1 and 
Tune 2, v/hereas non-distinctive features v/ere nivtn littlt attentinn in 
teaching materfali and textbooks. In short, teaching baied on struc- 
turaltst view of language regardid the various levels of grammar as If 
they exiited In isolation. In pronunciation teaching, it was not seen 
as necessary to Day any attention to levels other than that of phono! = 

ogy= 

The dynamic , or cofTinunicotive view of language emphasizes the 
communicative behaviour of the language learner. It is acknowledged 
that the success or failure of a communicative act does not depend 
solely on the linguistic structures internalized by the speaker: we 
have to consider various psycho- and socio! inguistic aspects In addition 
to all levels of grammar* One aim of future studies on contrastive 
phonetics will be to throw light on the speech behaviour of the foreign 
language student: a more proper terni for such analysis may be cross- 
language conmuni cation analysis. After all, more interesting than 
knowing what linguistic structures the speaker uses in his interlan- 
guage grammar is knowing why, how, and when he uses them. 



J.L, S K.S. 
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VOWEL REDUCTION IN THE ENGLISH OF FINNISH LEARNERS 



Hannele Hiikkinen 



INTRODUCTION 

The aim of the preient study is to discuss some thearetical aspects 
of English vowel reduction and their possible implications for Finnish 
learners of English. The question of vowel reduction is both linguisti- 
cally and phoneticaHy complex* Its status in a grammar is vague. It is 
particularly difficult to draw a Ifne between competence and pGrformance 
where vowel reduction with all ith phonetic parameters is concerned, as 
Chomsky and Halle {1568:110) note. There have been various interpretations 
of the graninatical status of the reduced vowels in English (see eg. Chomsky, 
Halle and Lukoff 1956) according to which /a/ is the only reduced vowel in 
English, According to an alternative interpretation English vowels are 
divided into two systems: l full vowel system and a reduced vowel system. 
The concept of reduced vowel system was introduced by Trubetzkoy (1969). 
This kind of solution has been suggested by eg. Hultzin (1961), Lehiste 
(1970)* Nyquist Goes (1574) and Wiik (1965). 

In the present study the tenn vowel reduction is used to Imply the 
fact that in unstressed positions the number of English vowels is reduced 
to two: /a/ and /i/. The study is mainly concerned with segmental factors 
and such factors as speech rate or rhythm are thus excluded. These factors 
would necessarily demand a different kind of analysis, probably requiring 
consideration of syntactic and semantic factors. Jakobson and Halle have 
noted (1968:413-414) that the sound structure Of speech may be astonish- 
ingly eiliptict depending on the situation, speech rate etc. but if the 
necessity arises i the speaker can imned lately translate his utterance 
into the explicit form, which the listener can readily understand. The 
framework of the present study is the explicit fonn cited fay Jakobson and 
Halles rather than the form used when "a variety of other factors (speedy 
casuamess, frequency of the itm^ predictability etc.)'' (Chomsky and 
Halle 1968:111) is involved. The phonetic analysis occurs thus on the 
level of word rather than on the level of sentence or discourse. 
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Phonetic di,cu,s1ons af vowel reduction ar, as various as phonological 

vow.i T'l " °' determining 

vowel reduction. Thus for e.a.ple Lindblo, (1963) argues that reduction 
s caused y the fact that neural co^ands to the articulatory mechanism 
follow each other too rapidly and th, mechanism is not able to r»ach the 
vowel target thus resulting in "undershoot". Lindblon, thus uses the term 

Another phonetic parameter of vowel reduction is centralization. It is 

cent f-'T'^ "'""^ ™- ^ 

c ntralized than stressed ones. This implies that the for^.nt values 

L'^Jo T 7V]:'' ' 500 H,. ,500 H. 

t L s V"'™*»^ °^ -^-ii. -ually mentioned 

textbooks, .s greater contextual assimilatiif or coarticul.tion with 
nvironment Thus for example Ohde and Sharf (1„5, use the ter,^ 
e ct.on to i.p,y the influence of the consonant upon the vowel. Vocalic 
nlue c. and influence of the position have also been discussed in phonetic 
mer. ure These three factors will be considered in this study and 
-.d ction^. as a phonetic ten., is used here to refer to the chLges 

::u:;rv:w:ir"^^ - - 

BACKGROUND OF THE STUDY 

The methodological approach of this study is based on contrastlve 

0 t,„. There has been general suspicion of the results of contra tive 
s ud es (see eg. Sajavaara ig77da). si..., according to the critics 

t Mching cruic.sm has also been made of the static point of view e=c 
,d in contrastlve studies, co^unic.tive competence and pr.g^t 
lems have been badly neglected (for a closer discussion, see eg 

^ the ^r"": f ''''' - t° 

r t he .pp,,„h because of various obvious shortcomings in its early 

1 «t,ons Thus, for example, the contrastlve approach does seem^L 
r considering the phonetic difficulties of foreign language le.r s 
T e Phonetic process of speech production is. after all. the on y eve 

L ^ T^' e.Jlv.len \ n 

the sound segments in languages, as Lehtonen (1977..31) remarks- 
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Foreign accint can be diflnid as acoustical deviations from the norm 
of the native speaker {Jonasson and McAniiter 1972:11). The factors 
contributing foreign accent can thus be measured. 

It has to be noted * however, that pronunciation errors are not alway! 
regular, or predictable. But some predictions can be made. Errors can bf 
causid by the influence of the mother tongue , or they may be due to the 
incorrect interpretation of the rules of the foreign language. Generally^ 
phonetic errors a^-tributable to the influence of the mother tongue may be 
said to reflect a rather low level of conipetence in the foreign language 
(eft Lehtonen 1977i32), Certain errors, howeveri seem to "fossilize" 
(cf. Sel inker 1972) and remain in the prDnunciation, being typical for 
speakers of a specific language. On the other hind, a speaker may himself 
construct an accent component "which has no reference in either of the 
languages he has been exposed to' (Jonasson and McAllister 1972:23). 
These incorrect pronunciation rules are typical for more advanced students 
such as the infonriants of the present study. This accent component can, 
naturally, change, and the student can eliminate certain rules and form 
new ones. But It seems that a foreign accent cannot be totally eliminated 
if the learner is an adult (cf, Lenneberg 19671181) although it can be 
greatly improved. It has to be noted, too, that all errors are not equally 
serious from the communicative point of view, and some accents can be 
■ tolerated better than others. These, and other related problems should be 
given serious attention in teaching (for a closer discussion of pronunci- 
ation errors, see Sajavaara, Lehtonen and May 1977:9-22). One problem, 
which had to be neglected in the present study, is the role of the hearer 
In the communicative situation, ie. the perceptual aspects of vowel re- 
duction. According to Lehtonen ( 1977i36), negative transfer in language 
learning is to a great extent due to different cue patterns in mother 
tongue and target language. This implies that the incorrect pronunciation 
is often only a reflection of "incorrect" hearing. Although the various 
acoustic cues involved in the perception pf a reduced vowel have been 
analyzed, the relative importance of them for the hearer has not yet been 
tested. 
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THE PROCEDURE OF THE STUDY 

Disynabic Inglish words with the itructure CVCV were selected as 
test material. The final unstressed syllable was thus available for study. 
The initial consonant (CI), was, when possible, a labial, while the Inter- 
vocalic consonant (C2) was alveolar. The consonants were plosives, if 
possible. The final vowel {V2) was /a/ or /I/. All the Engl ish vowels , 
and the diphthongs as well, occurred in the position of VI, the initial 
vowel. The principal aim was to detemilne the vocalic influence of the 
stressed vowel upon the unstressed one. The durational factors, mainly 
the influence of the preceding syllable upon the final vowel, were also 
considered. 

The informants (N^li) read the test words (NM5) embedded In a sen- 
tence frame and their productions were recorded on tape in the Phonetics 
Laboratory at the University of JyvaikylM. The tape recorder was Revox A77 
and the microphone AKG The infofmants were grouped according to 

their level of competence in English. There were 5 native speakers of 
English (E), 5 students of English who had acquired a cm teirfc ^level 
In their English studies (SI), and 5 students of other subjects (S2), 
The language skills of the SI group can be assumed to be typical of 
English language teachers in Finland, whereas the S2 group represents 
fairly well the language skills that are received in secondary school. 
All Informants were male because of the spectrographic analysis. 

Spectrograms were made from all test words using the Kay Elemetrics 
Sona-Graph 60618 of the Phonetics Laboratory. The nominal bandwidth was 
ISO Hz, resulting In a practical bandwidth of 300 Hz when the double 
recording speed was used. The upper limit of the frequency scale was 
4000 Hz. The duration of VI, V2, and C2 was measured from the spectro- 
grans with a 5 msec accuracy. The duration was determined according to 
the beginning and the end of the second formant. The formant frequencies 
of the first three formants were measured for both VI and V2 with an 
accuracy of 50 Hz (see Appendix), 

The statistical analysis required was carried out at the Computer 
C^^ntre of the University of JyvaskylM. A t-test was used to indicate 
the differences in means between the informant groups. The results include 
means (X) of various phonetic parameters, their standard deviation (SD); 
and the statistical significanwi of the differences In means between three 
Infonnant groups. Significance is indicated as follows^ + ^almost sign,, 
P 5% ; ++ ^significant, p 22 ; +++ .very sign., p ]% ; and ++++ -strongly 
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Sign., p Q.B%\ ngn ^ no statistical significance. {Jm differences of 
means have been compared for the most part between irnformant groups * 
but occasionally also between word groups etc.). Some additional calcu- 
lations were made using the Compucorp calculator of the Phonetics Depart- 



ment. 



PHONOLOGICAL ASPECTS OF VOWEL REDUCTION 

It Is a fairly common tendency among ths world's languaps that there 
are fewer vowels in unstressed positions. According to Trubetzkoy (19691 
236) this phenomenon Is called rfductlve neutraliiatfon (a subprocesi to 
vowel neutralization; cf. Done^an mu^r 1972), an<t 1t means that the 
phonological oppositions are nuutralized in L;np™?nent positions. 

Reductive neutralization -is a prdcess tnmm la several languages* 
but the specific rules vary -> n% Trubetzkyy states {lS69i22S) - from 
language to language, and ff'om dialect to dialect. To name but a few 
examples: in Swedish (Sigurd 1965:142), the number of vowels Is reduced 
to two In certain positions; in Italian (Tagllavini k Hioni 1971^64) 
the oppositions /e/ - /e/ and /o/ - /□/ are neutralised in unstressed 
position; in Modern Greek (Haiman 1972:368) the process Is similar to 
that of Italian! and in Russian (Trubetzkoy 1?S9:236) enly three vowels 
are capable of occurring in unstressed position. It can also be suggested 
that there 1s_a reduced vowel system in English consisting of the vowels 
/#/ and /t/, Af/ has also been proposed as part of the system, but 
it seems to bs generally agreed that it is rather marginal (Gimson 
1966:142; Wiik 1965:135). There is also a certain amount of free variation 
between /a/ and /s/, but the oppoiition is always present word-finally 
as in Mij UtMA (Wiik 1961:135). 

In Estonian, toOi there is a subset of four vowels that occurs only 
in unstressed positions (Lehiste 1970:141); but there do not seem to be 
such restrictions in Finnish. It can be suggested, however* that the 
FinnisH vow^l harmony rule serves a similar purpose, since It restricts 
the possible choice of unstressed vowel. Trubetzkoy (1969:236) classified 
vowel Mmony as a case of reductive neutralization, as vowel harmony 



h.-S?!^?^^! 2 quality occurs In unstressed position. Thii 

141-143) presence of secondary stress (see eg. Gimson 196 
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mians that certain vowil oppositions are fully relevant in word- initial 
syllablis only. In Finnish every vowel is capable of gccurring in un^ 
streised posttton* Nevertheless * the distribution of vowels is limited 
by vowel harmony. The initial vowel of the word, which also always 
carries the primary stresii detennines the quality of the following vowel 
according to the word- internal vowel harniony principle. The appearance 
of a back vowel /a o u/ in an initial syllable presupposes a back vowel 
in the following syllable, and a front vowel fi o y/ presupposes a front 
vowel. The phonetically front vowels /i/ and /e/ are neutral to vowel 
harmony i and can occur with either front or back vowels. ^ To show the 
common function of vowel reduction and vowel hannonyj Trubetzkoy sugges- 
ted a concept of 'prominence' that would phonological ly manifest itself 
by stress Or lengths and morphologically (as 1n Finnish) by a 'weight' 
on the initial syllable. 

Accordingly i there is a general tendency to strengthen prominent 
syllableSf or to make th^ perceptually as distinct as possible. This 
manifests itself in morphological, phonological and phonetic features , 
such as lengthening, diphthongizatlon, stress etc. (cf- Schane li73iS8; 
Hyman 1975:208). A complementary tendency is towards weakening of the 
unprominent syllables, which manifests itself in various ways, too: 
reduction, elision, etc. The differentiation of these two kinds of 
syllables is of course a perceptual aid for the listener. The actual 
process is different in different languages. In English, the reduced 
number of vowels is used to create a greater perceptual distance between 
them. The listener can "concentrate" more on the stressed syllables, 
when he knows he has to distinguish only between two vowels in unstressed 
positions. In Finnish, the listener can make predictions about the 
quality of the final vowel only after he has heard the initial syllable. 
The process creates a closer perceptual unity within the word, and It is 
one of the factors that creates word'^internal cohesion typical of a 
synthetic language like Finnish (for a discussion of word-internal ce- 
hision, see Karlsson 1977). 



The above discussion on Finnish vowel harniony is by no means complete. 
Vowel harfflDny has been discussed in detail by eg. Wiik 1975 and Karlsson 



1971. 




PHONETIC ASPECTS OF VOWEL REDUCTION 
DURATION 

It has been often clalmtd that duration is the primary factor of 
vowel reduction (cf, Llndblorn 1963). In English* as well as in varioui 
other languages, stress and duration seem to be positively correlated* 
The actual phonetic manifestations of stress are various. Liebennan 
(1S70;314) mentions changes in fundamental friquency* sound pressure 
levels duration and coarticulation. A stressed syllable has greater 
duration, higher fundamental frequency and lower degree of coarticul* 
at ion. ^ 

In English, it is argued, a stressed syllable is habitually longer 
in duration* This seenis to have been established by several studies 
(eg. Parmenter h Trevino 193S^ House 1961; Klatt 1975). Usually* the 
duration of secondary stressed vowels is slightly shorter than that 
of the primary stressed ones (Wiik 1965n34i Klatt 1975:31 ). The 
positive correlation between stress and duration has been reported to 
exist 1n several languages i in Swedish (Hadding-Koch 1961)* in Polish 
{Jassem 1959), in Russian (Jones & Ward 1969:206) etc. However, the 
rule that lengthens stressed vowels is not strictly ipeaking universal. 
There are languages in which the increase of duration In stressed 
syllables is minimal* or non-existent* and stress manifests itself in 
other ways. According to Lehiste (1970:35) these languages includi 
Czech, Estonian and Finnish. It is obvious that duration in these 
languages is used primarily as a cue for linguistic quantity. Unstressed 
syllables can also be longer than the stressed ones (they are regularly 
longer in Estonian* as Lehiste 1970il31 notes). 

What is the relative importance of duration as a cue for stress and 
thus as a cue for reduction? It has been claimed that duration univer- 
sally is the primary cue for stress (cf. Thelin 1971:145).: The testi' 
of Fry (1965) and Liebennan (1967:30) show that English listeners do 
rely heavily on duration when judging whether the syllable is stressed 
or not. According to Bolinger (1918:111) the primary cue for stress 

^ For a closer examination of parameters of stress* see eg. Lehiste 

1Q7A I - - 3 - - 




is p-.Jvided by pitch- It can be argued (cf. Udefoged 10fi7:4fi) that there 
1s no slnale acoustic evtnt that 1s charactenstic of all stressed syllablts 
in Inglish. It seems pcobable that different cuts are in close interaction, 
and that their importance depends am the previiling situation. 

According to Lindblom (1963;17S0) timing is the pfimtry variable in 
vowel reduction, and articulator^, imprecision or laxness is the consequence 
Of the shortened durition, and can thus be niglected in cortsidering the 
causes of reduction. Natyrally vowels loss their originaJ colour when they 
become very short. According to Delattre (1969:298) there is » different 
"duration-to-obscuration ratio" for each vowel. This means that when the 
duration of a vowel segment is decreased, there are less and less correct 
identifications. The loss of M -colour should occur at c. l.S msecs but 
the loss of /i/ -coiour not until c. 1.25 miecs. 

The above view of vowel reduction has been criticized by eg. Lehiste 
(1970:140) and Nyquist Goes (1974:70). They base their criticisin on the 
fact that there are reduced vowel systems in many languages. If reduction 
is seen as a systefflatical phenomenon, it has to be acknowledged that the 
vowel quality parameter cannot be ignored. Also, unstressed vowels can 
be longer in duration than stressed vowels, and still no de-reduction 
occurs. It seems, though, that the term 'reduction' is again used with 
slightly different meanings. Lindblom's data coniists of Swedish CVC 
syllables, and vowel reduction is what occurs to the vowel when the speech 
rate is increased. Lehiste sees reduction as a phenomenon of language 
system: variation of two vowel systems. While Lindblom's results are un- 
doubtedly correct, it cannot be claimed that duration is the only factor that 
causes reduction, if reduction is regarded in a wider linguistic sense. 

As regards Finnish, it can be said that durition cannot be a decisive 
cue for stress. Stress is fixed in Finnish: main stress falls always on 
the initial syllable and every other syllable receives a secondary stress 
The last syllable is unstressed. Stress is thus predictable and serves 
as a boundary signa,l. The phonetic parameters of Finnish stress have not 
been studied much.' Raimo (1972)1 ^ ^^^^ 

test that increase In intensity does not Increase the identifications of 



8.-9f?J972:'^ "* of Phonetics, JyvBskylM, Finland. 
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stfiss, while a change in pitch seems to InfliJence the impresiion of 
stress. Wiik (1975)^ suggested in a study of word boundaries that 
intensity^ fundamental frequencys duration, and quality can all act as 
cues for stress in Finnish* Sovijarvi (1954) connects stress to rhythm^ 
and argues that rhythms and thus also stress, is manifested mainly 
through variations of intensity. Lehtonen and Koponen (1977i7i) 
include also glottal ization as a manifestation of (sentence) stress- 
No systematic survey of the parameters of Finnish stress has been made, 
and it seems realistic to presume that their relative importance depends 
on the contextual factors ^ as in English* 

Duration cannot be a primary cue for stress, since it is used mainly 
to signal linguistic quantity » Phonologicany single and double vowels 
and consonants are distinguished mainly by their different duration 
(eg. taU "fire" vs. tuM "wind"; tatt "fire" vs, tutU "Customs"). 
The doubling of a segment is not a sign of prominence in Finnish^ 
both single and double vowels can occur in stressed as well as in un- 
stressed positions (eg. tulm "he comes", where the final vowel is un- 
stressed Ee:]). Finns do seem to use the duration cue primarily for 
distinguishing single and double vowels, even if there is also a differ- 
ence in their quality (Lehtonen 1970;86-88). The difference in quality 
between shorter and longer vowels seems to be a kind uf phonetic universal* 
English uses this feature to distinguish between /tense/ and /lax/ vowels* 
while Finnish does not use it to distinguish between /single/ and /double/ 
vowels. Finnish uses duration for dlstlnguiihing vowels, while English 
uses it mainly to signal stress. 

In Finnish the question of duration is, however, more complicated. 
As it was stated, both single and double vowels can occur in the final 
syllable. The actual physical duration of the final vowel is, however, 
influenced by the preceding syllable, especially when the intervening 
consonant is single. Thus both single and double vowels are ihortened, 
if the preceding syllable is long (ie. having a double vowel, a diphthong 
or a vowel - consonant combination). Accordingly, the final vowel is 
usually slightly longer in tulgg "he comes" than in ^u£i£e£ "the wind 
blows", and considerably longer in Cute ''fire" than 1n tuuU " the wind". 

^ A paper read at the Sixth Conferenee of Phonetics, Joensuu, Finland^ 

24.-25.6,1976. 
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There are also four actual duritions for the unstrtssed final li^llable 
of a disyllabic word. In EngHsh the duration doii not depend on the 
prseeding syllable. This muit cause certain problems for Finnish students; 
thay will probably overdifferentiatt the Engliih durations. Wiik (1965n38) 
has noted, that students tend to use these different variants, short and 
half-long. depending on the preceding syllable. The half-long variant is 
used in words like bUti>i and the short in eg. faeatM. Wilk suggests that 
students Should get rid of the half-long variint. The results gained in 
the present study seem to indicate, howeyer, that the half-long variant 
would in fact be the more correct one. The results concerning duration 
are surmarlzed in Tables 1 and 2. 

Table 1. The average durations (msics) in certain word groups 



VI ^ 
tense 

n ^ 
/§/ 


1 
SI 

S2 

E - SI 
E - S2 
SI - 


X ' SD 


C 

X sn 


X ^ SD 


168 53 
171 SO 
153 35 


103 30 
89 33 
100 33 


71 32 
71 39 
74 36 


nen 
non 
non 


non 
non 

1 non 


non 
non 
non 


VI - 
lax 


E 
SI 

S2 

E - SI 
E - S2 
SI - 52 


82 24 
76 18 

83 16 


131 20 
121 25 
118 26 


80 25 
84 ■ 39 
84 32 


non 
non 
non 


non 
non 
non 


non 
non 
non 


VI = 
diphthong 

V2 ^ 
1^1 


E 

SI 
S2 


173 38 
173 40 
179 39 


97 41 

82 35 
79 45 


72 26 
75 40 
80 34 


E - SI 
E - S2 
SI ^ S2 


non 
non 
non 


non 
non 
non 


non 
non 
non 



The table shows the average durations of all vowels in each informant 
group, the standard deviitions (SD) and also the s1gnif"ance oHhe 
d fferences of means (j - almost significant. p<5S;%+ =",g"i?Icant. 
p<2%. +++ = very significant. p<lK-. ++++ . strongly signlfilant. p<0 5S) 
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When the final vowel is /s/* (Table 1) there does noet seam to be 
any great difference bitween the perforniances of English and Finnish 
speakers, Finnish speakers do produci a shorter variant * if the preceding 
vowel is tinse - since Finns intirpret the tense vowel as a double vowel 
and thus the syllable as a long one - but so also do native English speak- 
ers. Finnish speakers do not seem to be affected by the rules of their 
mother tongue. There are various explanations for this seeming lack of 
error. One is the fact that Finnish listeners can in fact hear the final 
vowel as a lengthened one. In a listening test carried out by Wiik (1965: 
138) 15% of the listeners (schoolchildren) designated the final vowel with 
two vowel letters. This happened more often after a tense vowel. This 
shows that a Finn may hear that English speakers use a longer variant than 
he 11 accustomed to hearing in the respective position in his own mother 
tongue. More probably the students were affected by the litter 4 that 
occurred in the orthographic representation of the test words. Many 
students were obviously unsurej whether to pronounce the /r/ or not. 
They may also have been influenced by American English pronunciation. 
However 4 the hesitation, or the production of some kind of [r] may obscure 
the results with this group of words. Yet another factor that can influence 
the results is the statistical method used. In a small sample like this 
the oppoiing tendencies may create a misleading image. Thus it seemed 
that some of the students regularly commit the expected errors, while 
others do quite the opposite. As only the means of each informant group 
were comparedj individual tendencies may have been ignored. Yet it can 
not be claimed that the students had achieved an error-free pronunciation. 
This can be seen more clearly in the results that concern words ending 
with fif^ which have been pfesented in Table 2, 

As it can be seen in Table 2, Finnish students produce regularly 
a significantly (++++/+) shorter vowel than the English speakers* if the 
preceding vowel is tense. This is just what was expected. The unexpected 
thing is that the error seems to be cormitted systematically by the students 
of English, not by the other students. Again, the error is seen only as 
regards the position where the initial vowel is tense. This Is explained 
by the fact that the duration used after a lax vowel matches the ''correct" 
English duration. It seems, toOi that the duration used by Finns for final 
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Table 2. The average durations (mseci) of ctrtain word groups 



ERIC 



VI n 

tinii 

V2 - 
/I/ 


I 

SI 

SI 

E - SI 
E ^ Si 
SI - Si 


X ' SD 


C 

I SD 


X ^ SD 


184 36 

183 44 

184 40 


8^ 34 
69 24 

63 33 


88 26 
SO 21 
70 32 


non 
non 
non 


+ 
++ 
non 


++++ 
non 


VI = 
lax 

V2 = 
Itl 


E 
SI 
Si 


SO 26 
81 23 
86 iS 


111 36 
9i 3i 
101 4i 


88 14 
81 33 
68 36 


E - SI 
E = Si 

s] - Si 


non 
non 
non 


+ 

non 

non 


non 
++ 
non 



/p/ 1s approxifnately coffict, but the duration usid for final /i/ gener- 
ally too short. The average duration of /#/ and HI in the whole material 
IS 78 vs. 88 msecs respectively in Group E, 82 vs. 71 msecs in Group SI 
and 85 vs. 69 msecs fn Group Si. Thus Ftnns produce a shorter /i/, in 
aecordince with the universal tendency for close vowels to be shorter, 
while English speakers produce a clearly longer /i/. The longer duration 
of final /,/ probably serves the purpose of greater distinction between 
the two vowels but many linguists have also analyzed it as a tense vowel 
(eg. Chomsky h Halle 1968). 
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The influence of the preceding syllable upon the final vowel is 
iuniTiarized in Table 3, 

Table 3. A coniparisDn of vowel durations in syllables following lax vs. 
tense vowels of English 





E 


SI 


S2 




/tense/ 
/lax/ 


71 

80 


71 
84 


75 
84 


^ /i/ 


/tense/- 
/lax/ 


non 


non 


non 


/tense/ 
/lax/ 


88 
88 


50 
81 


58 
70 


^ Itl 


/tense/- 
/lax/ 


non 


++ 


non 



It can be seen that because of the various, partly conflicting 
influences, the erroneous production due to the influence of the preceding 
syllable manifests itself in certain positions only: in words ending with 
l%l and in only one infomiant grDup. 

On the whole, there are two tendencies that can be detected In the 
pronunciation of Finns. One Is the Influence of the preceding syllable. 
The other is the tendency to use a duration that is too short. The 
latter seems to be due to a wrong interpretation of the timing rules of 
English, The vowels are usually shortened when unstressed. Some of 
the Finnish students seem to have a rule that shortens all unstressed 
vowels, regardless of their position. English speakers have a rule 
that lengthens final vowels^ ^ and this rule seems to overc^e the 
shortening rule (cf. Wlik 1965:134; Oelattre 1959:292). Thus for 
example the average duration of a lax VI in words ending with /e/ was 
82 msecs, while the duration of V2 was 80 msecs. The duration of the 
unstressed vowel was thus practically equal to the duration of the 

" Final lengthening seems to be a universal phenomenon^ see eg. Lindblom 
1968; Nootebooffl 1972, 
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strissid vowel. It can be evin longir.aa the dupatlens of vowels in words 
ending with tzl showi the average duratipns of a lax VI was 80 msecs while 
the average duration of VZ was 88 msecs. 

QUALITY 

One of the factors of vowel reduction Is said to be the centralized 
quality of the reduced vowels, ie. the close association with acoustical 
neutral vowel. In the following the reduced vowels of English are 
considered from this polnf of view. The quality of the stressed vowel Is 
often called a full vowel quality. It cannot, however, represent a target 
value of vowel since also stressed vowels are influenced by the environmint. 
Tense vowels represent more characteristically the full vowel quality, 
since lax vowels are Inherently more reduced, or closer to niutral vowels 
In both English and Finnish (cf. Stetson 1951:43^ Wllk IgfiSiSS). The 
Interrelation of tense and lax vowels 1s shown In Table 4, 

vowlls^' " * frequencies of certain tense and lax 







1 


F2 


F3 




/1i/ 


Itl 








Itl 


E 

SI 

S2 


300 
370 
320 


440 
390 
360 


2340 
2220 
2160 


1900 
2080 
2170 


2850 
2700 
2880 


2600 
2660 
2770 




ImM 


M 


NM 


M 


/u:/ 


M 


E 

SI 

S2 


380 
370 
410 


460 
430 
41i 


1140 

830 
820 


980 
980 
840 


2400 
2610 
2730 


2430 
2465 
2550 




/O.'/ 


/o/ 


/Of/ 




/O;/ 


10/ 


E 
SI 

%1 


375 
470 
500 


SSO 
520 
490 


690 
890 
940 


1010 
1100 
1090 


2S60 
2475 
2575 


2550 
2460 
2600 




laM 


/a/ 




Ul 


laif 


1^1 


E 
SI 

Si 


650 
590 
710 


700 
550 
720 


107S 
1130 
1240 


1290 
1260 
1280 


2560 
2360 
2500 


2560 
2450 
2450 
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On the whole, the effect of cintral ization seems to b& greater in 
Group The students in Group SZ seem to use nearly Identical qualities 
for tense and lax vowels. It seems that English speakers realize the 
laxness of vowel mainly In terms of openness. Thus for high vowels Fl 
is raised and FZ is lowered. For non-high vowels both Fl and F2 are 
raised. Finnish speakers, on the other hand, seem to be affected by the 
ffontness of the vowel: they raise F2 for back vowels and lower n for 
front vowels. It may be noted that lax vowels may not always be literally 
closer to a neutral vowel. Thus, for example, all groups raised Fl when 
producing a lax A/^ ie, the value of Fl was getting further from that 
of the neutral vowel. The above results are also shown as fonnant charts 
{Figure 1) for a better illustration of the processes discussed here. 

Figure K The formant positions of certain tense (— ) and lax ( — -) 

J-f Jh^^''^ 2^ produced by English speakers (E), students of English 
(SI) and other students (SZ). . _ a . i 
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500 
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A crucial qugstion In the discussion of the centralization is the 
Starting point, or the point of comparison. From what poiltlon are the 
vowels centralized? For /i/ we can define a lax average value* but It 
is more difficult for /e/. In the following the question Is considered 
In the light of some textbook examples. Usually /a/ has been described 
as obscured, murmured, dark, etc. Artlculatorily it has been described 
as having a central tongue position and neutral lip rounding (see eg. 
Gimson I956m8-119' Lehtonen, Sajavaara and May 1977:1 17-118^ Stubelius 
1968:42). Acoustically, Its foniiant values are near to those of a neutral 
voweh It could be suggested that in English /g/ is an independent^ 
systeniatlc vowel, not simply a variant for all unstressed or sloppily 
pronounced sounds. It seems to be related to the stressed /a^/ since 
they have arisen from similar contextual envi ronments = the following /r/ 
has originally caused a centralization of certain vowels* and when /r/ 
has disappeared, the quality has persisted. Whether the phoneme /§/ 
Includes vowels derived from other sources is somewhat unclear. The status 
of endings like -oi, etc, seems to be slightly ambiguous, and there is 
probably some variation due to presence vs. absence of secondary stress. 
The phoneme /a/ has several allophones^ Wlik (1965:136), for Instance, 
gives seven different variants and Jones (1967-92=93) thre*j. Usually 
it is agreed that the final variant is more open. 

As regards the quality of English /s/, it can be presumed that Finnish 
students either tend to use the vowel quality Implied by orthography , 
or tend to use the nearest equivalent 1n the Finnish vowel paradigm, 
viz. /o/. The influence of orthography Is not considered here, but It 
is undoubtedly considerable. Here all test words were chosen to be "easy", 
SO that the students h^d to know only some obvious rules of pronunciation* 
The substitution of /B/ for /#/ is, according to Enkvist (1963:47}, 
a typical error of a Finnish student of English. The English vowel 1s, 
however, rather remarkably different from the Finnish one: /a/ is central, 
while /bV 1s front, /a/ is a little lower than /bV and, above all, it has 
a neutral lip position while /b/ is rounded. The average formant values 
of the vowel /g/ in this material are shown in Table 5. 
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Table 5, The average fonmant positions of V2 ^ /i/ of three word grouDS 





Fl 


FZ 


F3 


E 


480 


1480 


2600 


SI 


470 


1510 


2400 


SZ 


470 


1460 


2390 



The tendericy of Finns to have generally lower formant friquencies 
Is clearly reflected 1n the results. The F3, espiclally, is consistently 
lower with Fi^^nlsh speakers {the lowering is attested by the statistical 
significance: ++++). The behaviour of F3 probably reflects the fact 
that Finns produce a rounded vowel. It could be suggested that students 
might be told to use a more ta]-11ke quality In final position at least, 
in order to avoid the lip rounding (cf. ErSmetsS 1951), 

As regards the quality of unstressed, final ft/-^ the comments In 
textbooks are contradictory. Jones (1967:68) and Johansson (1972:21) 
argue that final /:/ 1s particularly open, or closer to /e/, Wlik's 
(1965:135-138) r#:..lts show that final /i/ is more closed, or closer to 
/I:/, aimson {1956:97) notes that there is a lot of variation in the 
pronunciation of final /i/: It may be centralized, or it may be replaced 
by /i:/. The results of the present study show that all English informants 
preferred a quality that is more close than open. The results' are shown 
in Table 6. 

Table 6, The average formant positions of V2 ^ /i/ of two word groups 



^ /tense/ 






Fl 


■ 








F£ 


F3 




E 


325 


2120 


2645 




SI 


385 


1980 


2565 




S2 


395 


2070 


2720 


E 


- SI 


++++ 


++++ 


non 


E 


- %2 


++++ 


non 


non 


ST 


- S2 


non 


non 


+++ 



V, - /lax/ 



Fl 


F2 


F3 


350 


2155 


2670 


385 


2020 


2600 


355 


2060 


2725 


+++ 


++++ 


non 


non 


+ 


non 


+ 


non 


+ 



2B 



- 33 - 



In both groups students of English have a signif icintly lower F2 
than native speakers, and also significantly higher Fl. The students of 
English have been taught to use a different quality for tense and la^ 
vowels, and thus they use the lax Quality they have learned in final 
position as well* English speakerSi however, use a quality nearer to 
the tense vowel, as can be seen in Table 7. 

Table 7. A comparison of formant positions Of Stressed /i:/^ stressed 
A/, and unstressed /i/g 





/i:/ 








E 


300 


440 


.^40 




SI 


370 


390 




Fl 


S2 


320 


360 


380 




E 


2340 


1900 


2140 




SI 


2200 


Z080 


2000 


F2 


S2 


2160 


2170 


2070 




E 


2850 


26on 


2660 




SI 


2700 


2660 


2590 


F3 


S2 


2880 


2770 


2720 















English speakers use a quality for the final /l/ which lies between 
the average values of stressed /i ' and stressed /i/. It seems that Finnish 
students tend to produce a vowel that is too "tense'' when stressed and too 
"lax" when unstressed. 

As we have seen, there is a tendency for /a/ to drift towards a fnore 
open position, and for /%/ towards a more closed one, when they are in 
final position. This is what one would expect precisely in final position, 
since the strongest need for retaining the opposition occurs Just there. 
Since eg. -m and -y are both productive endings, it Is natural that the 
distinction between them Is preserved. 
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COARTICULATION 

It has been argued that one of the factors that causes reduction 
is greater contextual Influance* or coartlculatlon. On the level of 
speech production, speech sounds are not the Invariant entities that 
they are on the level of perception. Speech flow cannot be segmentated 
unambiguously into linguistic units of any kind, eg. phonemes (cf, Kent 
and Minlfie 1977:116). The sounds Influence each other and a phonetic 
feature may be expanded to several segments not originally having this 
feature. Thus coartlculatlon can be defined as "the influence of one 
speech segment upon anotheri ... the influence of a phonetic context 
upon a given segment" (Oaniloff and Hantnarberg 1973:239). 

The fact that coartlculatlon exists implies certain things about 
the nature of the higher-level progranriing of speech, ie. the neural 
events connected with speech processing. As the acoustic and artlculatory 
events seem to be of variant nature, while the perception of speech occurs 
on the basis of invariant linguistic units, a model for speech processing 
should be able to explain how the discrete linguistic units and the 
physical speech continuum are made compatible within the speech programming 
system. 

Basically coarticulatlon can be either forward or backward. Forward 
(anticipatory, regressive, right-to-left) coarticulation occurs, according 
to Kent and Minifle (1977:117), when articulator^ adjustment for a segment 
Is started during an earlier segment. One example of forward coarticulation 
is anticipatory lip rounding in French, studied by Benguerel and Cowan 
{19?4), They found that upper lip protrusion for a rounded segment can 
start as much as six segments In advance, eg, already on the initial 
consonant in the combination /strstry/. An anticipatory nasalization 
in English is discussed by Moll and Danlloff (1971), who argue that nasal 
coarticulation can occur across the word boundary. 

The other type of coarticulation is backward or retentive, progressive, 
left-to^right coartlculatlon. Here an artlculatory adjustment for one 
segment is carried over to a later segment (Kent and Minifle 1977:117), 

Retentive coarticulation can be attributed to inertial factors, 1e, 
to the mechanical slowness of the artlculatory mechanism. Thus no hypoth- 
iSis about the hlgher^level prograrmiing need be made. Anticipatory 
coartlculatlon is harder to explain If neural factors are not considered. 
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A physically later Pigment can not influence an earlier segment unless 
the system somehow hai information concerning several segnients simul- 
taneously. This gives raise to the question of the size of the speech 
Pfogranniing unit. Kozhevnikov and Chistovich (1065) have expla^^ned 
anticipatory coarticulation by determining that the basic speech prograrrmihg 
Unit is a syllable. In their model the minimal unit is an open syllable 
(C)'V. The articulatory events are ofganized in sy t lable-sized chunks. 
The production process is di scontinuous i there is a boundary after a vowel 
sound. The vowel detinriinis the preceding consonants, and thus explains 
anticipatory events. 

According to the binary model discussed by eg, Moll and Daniloff (1971) 
speech is programmed in segment-sized units which are given a binary speci- 
fication (+, or unspecified). The anticipatory coarticulation is 
explained by a scanning device, which looks upon the next specified segment 
iii the sequence^ and gives the same specification to all preceding unspecifle* 
segments. There seems to be evidence to show that neither the Kozhevnikov 
and Chistovich theory nor the Moll and Daniloff model is able to explain 
all coarticulatory phenomena. Thus, for example, coarticulation can occur 
across syllable boundaries (Moll and Daniloff 1971; Kent, Carney and Severeid 
1974; Dhman 1966:151-152), which shows that the model proposed by Kozhevnlkov 
aiid Chistovich cannot account for all cases of coarticulation* Ihe binary mofli 
of Moll and Daniloff has been criticized eg. by Kent, Carney and Severeid 
(1974). It will presumably be necessary to postulite more than one unit for 
the speech programming model (for a closer discussion on the relevance 

□f coarticulation to speech prograrTining models, see Kent and Minifie 

1977). 

One of the most crucial ques: am the point of view of the present 

subject is the notion of target, :on of coarticulation does not 

make sense* unless some kinds of inm r^ist properties are postulated for the 
segment. How can the inherent properties be defined; acoustical ly» articu* 
iatorlly. or in sonie other way? Ohde and Sharf (197Si923) presume that a 
target is an ideal articulatory position of the vowel produced in Isolation. 
Lindbloni argues (1963:1778) that target represents **soffle physiological 
invariancr. For f%cNeilage (1970:190) target is an invariant articulatory 
(motor or spatial) goal. Kozhevnikov and Chistovich (1961) use the temi motor 
goal: these are stored in the braln^ and actual motor cofmiands issued depend 
both on the motor goals and the state of the articulatary mechanism. 
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Are targets related to phonanes or some other linguistic entities? 
Most writers prefer phonemes, but eg, Wlckelgrin (1969) has proposed an 
ariophonlc model. As all possible allophones are listed and stored in 
the brain, his model would presuppose a vast prografmilng capacity. The 
model Ignores also the fact that many coarticulatory phenomena can be 
best described by generalized rules, not as single unreTated events. 
According to Hairmarberg (1976), who thinks that coarticulation is a 
totally linguistic process, governed by low-level phonological rules that 
link phonology and phonetics together, a speaker actually hits the tarqet 
when producing a coarticulated form. Thus the speech mechanism does not 
miss the target, but does what it is told to do by the brain. Perhaps 
some justification for HarriTiarberg ' s views is offered by the fact that 
coarticulatory rules are not universal: if coartfculation could be 
accounted by the physiological factors, the rules would have to be of a 
more universal nature. It seems to be probable, 1n fact, that coarticulation 
rules are to some extent dialectal, or idiolectal (cf. eg. Bladon and 
Al-Bamerni 1975; Kent, Carney and Severeid 1974), 

What is the influence of coarticulation upon vowel reduction? It has 
been argued that reduced vowels coarticulate more strongly with the 
surroundings (Nord 1974:149^ Tiffany 1959; Shea™ and Holmes 196U 
Stevens and House 1963). The surrounding consonants do have a centralizing 
effect upon vowels In general, as was confirmed by Joos (1948), The place 
of articulation of a consonant affects the vowel, too, as was noticed 
within the framework of locus theory (eg. Cooper et al , 1952; Delattre 
et aK 1955), Locus theory has been criticized by eg. Dhman (1966:155) 
who argued that the transitional pattern of a vowel does not depend entirely 
on the consonantal context, and that vowels, also, may influence other 
vowels. The average consonantal effect is, however, according to Delattre 
(1969:299-300) the shift of the F2 of the vowel towards the locus of the 
adjoining consonant. As regards the direction and causes of consonantal 
Influence upon the vowel, Ohde and Sharf (1975:925) give three possible 
models. The first is an anticipatofy scanning model: the vowel is modified 
to approach the place of articulation of the following consonant. The 
second is a retentive mechanoMnertial model: the rapid shift from a 
consonantal to a vocalic position is prevented by inertia. The third 
model is a retentive compatibility model: the vowel is modified to approach 
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the place of articulation of thi following conaonint. According to Ohde 
and Sharf vowil reduction {herg; the consonantal influence upon voweli) is 
caused niainly by the retentive models: a vowel in a CV syllable would be 
more apt to reduce than a vowel in VC syllable. Data supporting this 
hypothesis is offered by the study of MacNeilage and DeClerk (1969) which 
showed that the coarticulation in CVC structures was stronger between the 
segments in CV part of the item than in the VC part. 

The quality of vowels can also be affected by their position: eg. 
following or preceding rest position. According to Nord (1975:153-154) 
final unstressed vowels coarticulate strongly with the rest position of 
the vocal tract. Thelin (1971 1 149) notes that in Russian \owel reduction 
can be caused by a following or preceding rest position, Ohman's (196S: 
154) study of Swedish vowels shows that in some vowels the forniant fre- 
quencies are more central in final positicn than in initial position. The 
final reduction can be explained either as the influence of the preceding 
vowel (cf. Ohde and Sharf 1975) or as the influence of the following state 
of the vocal tract (cf. Nord 1975), and the two-fold influence is probably 
the reason why final vowels often tend to reduce. 

The quality of vowels can also be influenced by other vowels. The 
overall vocalic influence in VCV structures was studied by Ohnian (1966) 
whose results indfcate that speech production is in fact a series of vowel 
gestures with intervening consonants. In another study of VCV structures 
Fant et aK (I970:in) argue that the course of F2 within the final vowel 
seems to be depending on the quality of the initial vowel. According to 
Daniloff and Haninarberg (1973:244) the effect of the preceding vowel upon 
the final one is the adjustment of openness: an open VI causes the VZ to 
be more open. The vocalic coarticulation can thus reach across a syllable 
boundary. 

The final vowel can thus be influenced by the preceding vowel, the 
preceding consonant, or by the following rest position. The aim of this 
Study was to study the vocalic Influence, The Figures 2=3 show the formant 
positions of certain tense and lax vowels and the formant positions of the 
following vowel /i/. The numerical values of the formants are shown in 
Table 8, 
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V2 = /9/ 



VI 


I 


e 


m 


\r 


Q 


A 




E 
SI 

%2 


480 
490 
460 


480 
480 
460 


120 
500 
470 


480 
460 
490 
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460 
450 


470 

eon 

470 


Fl 


E 
51 

S2 


1500 
1640 
1S90 


ino 

1590 
1570 


1500 
1610 
1580 


1490 
1520 
1390 


1420 
1520 
1425 


1430 
1530 
1500 


F2 


E 
SI 

S2 


2670 
2470 
2450 


2640 
2450 
2460 


2650 
2510 
2460 


2660 
2430 
2410 


2400 
2360 
2350 


2640 
2490 
2480 


F3 
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V2 ^ >V 



VI 


i > 




a; 








E 
SI 

%l 


440 
430 
400 


510 
410 
450 


480 
490 
450 


450 
470 
480 


470 
490 
510 


Fl 


£ 

SI 

52 


1560 
1670 
1130 


1500 
1530 
1570 


1470 
1460 
1480 


14S0 
1450 
144C 


1460 
1475 
1440 


F2 


E 

SI 

SZ 


2680 
2400 
24S0 


2700 
2600 
2390 


2660 
2350 
2370 


2410 
2390 
2340 


2650 
2425 
2425 


F3 



38 



ERIC 



- 42 - 



It can be seen that there is a slight attraction towirdi the preceding 
vowel qusllty m each informant group. The influence of the initial tense 
vowsl upon thi final vowel is eonsidirabli In the group of other studBnts 
m. There seem to be two areas of attraction: after a front vowel the 
final vowel is more front and high, and after a back vowel the final vowel 
IS more retracted and low. The initial lax vowel iosms to exert a slightly 
different Influence: the decisive factor stems to be the behaviour of F2 
while Fi is kept relatively constant. Table 9 shows the effect of certain 
vowel dimensions upon the final vowel. 



Table 9 
to the 



. 'he formant frequincles of the final /a/ classifiid accordina 
naturi of the Initial vowel 





V 


2 ^ 








E 


SI 


S2 




high 

mid 

low 


445 
490 
480 


450 
470 
490 


440 
480 
450 


Fl 


front 
back ^ 


475 
470 


440 

485 


425 
485 


Fl 


front 
back 


1530 
1465 


1600 
1475 


1550 
1455 


F2 


labial 
1 11 ablal 


2550 
2680 


2410 
2450 


2380 
2400 


F3 




^"■^^^ firoup 
Vpla?^^^^ 


E 


SI 


S2 




high 

mid 

low 


480 

485 
495 


495 
470 
500 


465 
455 
470 


Fl 


front 
back 


495 
480 


490 
490 


465 
470 


Fl 


front 

back 


1505 
1445 


1615 
1525 


1580 
1435 


F2 


labial 
illablal 


2530 
2650 


. 2395 
2480 


2380 
2465 


F3 
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The table shows that English spiakers tend to raise Fl slightly. 
If the pficedlng vowel is non-high, Finnish speakers tend to lower Fl If 
the initial VDwel is front. FZ displays more variation. If the initial 
vowel is back, all informants lower FZ. It also Siime that F3 is affected 
by the initial vowel i after a round vowel it seems to be lowered. For 
Finnish speakers the effect is very slight. When the initial vowel is 
lax, the results are slightly different. The effect of the initial vowel 
on the Fl of the second vowel is small, F2 is, however, clearly raised 
after frQnt vowels. The average value of FZ of the final vowel is signi- 
ficantly smaller (+++) when the proceding vowel is back 1n two Infortnant 
groups^ English speakers and Students (52), As the behaviour of the 
second formant seemed to be of some Significance, the correlation between 
the second formants of both vi and V2 were also calculated. The results 
are seen in Figures 4-6, The correlation was calculated separately for 
tense and lax vowels, as their influence seemed to be different. 

Figure 4. The correlation between FZ of the initial vowel (Vi) and FZ 

of the final vowel (Vg) shown as a diagrami native speakers of English (E). 
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Figurt 6. The corrilatlon between F2 of the initial vowel (Vi) and ¥2 
of the final vowel (Vg) shown as a diagram; Finnish students (S2). 
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The data for English speakers shows that there seems to exist a posi- 
tive cofrelation between the second fomants of initta? and final vowels. 
The correlation is stronger when the preceding vowel is tense {r=0.i7 with 
tense Initial vowel; with la^ initial vowel). Students of English 

(SI) appear to have the same tendency. There is a stronger correlation 
when the Initial vowel is tense {r-0.60 vs. r^O.46). Other Finnish students 
have a stronger correlation between the vowels when the initial vowel is 
lax (r^O.58 vs. r^O.34). One Important fact should be noticed. There 
seemed to be a substantial amount of individual variation as regards the 
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coarticulatory influence between the two VDweli, although some facts 
appeared to be characteriitic for British speakeri, some others for Finnish 
speakirs. Further, one his to be eareful fn drawing any conclusions from 
the coarticulatory processes in Finnish, sinee the material consisted 
Qnly of English items. The differences in the productions of native 
speakers and Finns may reflect only the fact that Finns are speaking a 
foreign language. It Is quite possible that the features of their 
"interlanguage" are different from English and frotri Finnish. Also, the 
material Includes vowel combinations* which, If interpreted 1n terms of 
Finnish phonology, are strictly ungrammatlcal i eg. poUm - +poto. The 
above results only apply to /a/. The Influence of the initial vowel upon 
/X/, however, proved to be rather similar. The summary of the Influence 
Is given In Table 10. 



tf iL^nitJS^J^fh'"* ^^«3y^"Ctes of the final /I/ classified according 
to the nature of the preceding vowel 





Vg 




E 


SI 


SZ 




high 

non=h1gh 


310 
340 


365 
410 


370 
410 


Fl 


front 
back 


i1§5 
ilOO 


2055 
1920 


2060 
2080 


n 


labial 
11 labial 


2596 
2680 


2530 
2590 


2680 
2750 





/t/ also seems to be influenced by the openness of the preceding vowel, 
Fl 1s higher after a non^hlgh vowel, and F2 Is raised after a front vowel. 
Finnish students do not have this tendency, however. If the Initial vowel 
is rounded, there is a slight lowering tendency. In general, the results 
are in accordance with the data from /a/. 

Summing up the results for coartlculatlon, 1t may be argued that 
certain vocalic dimensions can be retained through the whole VCV structure. 
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Thi initial vowel seems to detirmine the quality of the final vowel to 
SDme extent. It Is alio possible that stressed vowels tend to deternilni 
the quality of the unstressed ones, This could not, however^ be studied 
With this material . 

CONCLUSION 

The aim of the present ^tudy was to investigate the difficulties 
met by Finnish students when producing unstressed vowels of English. 
Jhe 5C0pe of the study was limited to final vowels. Phonological ly, 
Finnish and English differ radically. It was noted, however, that there 
is a tendency in many languages to neutralize the oppositions between 
vowels In unprominent position. It was suggested that in English this 
tendency 1s realized in tenns of vowel reduction, and in Finnish, partly, 
in terms of vowel harmony. 

The phonetic investigation included the measurement of duration, 
vowel quality, and the degree of coarticulation between the vowels. As 
regards duration, it may be noted that students tend to produce a final 
vowel that 1s too short, or that they tend to use two different durations 
depending on the length of the preceding syllable. The differences In 
vowel quality were also considerable between English and Finnish speakers. 
Generally, Finnish speakers tend to produce a rounded vowel for /a/ and 
an /e/-l1ke, or too lax vowel for the unstressed /j/. The last error may 
be due to the fact that Finnish students exaggerate the difference bet= 
ween tense and lax vowel qualities. It may also be due to the fact that 
they use the same quality for l%l in every position, whereas the English 
speakers had different qualities for final and non^final variants. The 
results from the measurement of coarticulation are necessarily very tenta- 
tive. It would seem, however, that a positive correlation between the 
initial and the final vowel can be established. Thus the initial, or 
stressed vowel would seem to have an influence on the final one. The 
exlstince of such Influence is common to each informant group. The actual 
manifestation may be different. It may be noted again, that though the 
coarticulation betwien the stressid and unstressed vowels may be universal, 
the rules may be language-specific, dialectal and Idiolectal. 

How can the results be applied in practice? It is difficult ^ perhaps 
impossible ^ for an adult to achieve a native-^like pronunciation of a new 
language. That is why the adult teaching should emphasize the cognitive 
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sidt of languagi liarntng. What cannot bi achliVid by Imitatloni can 
perhapi be acKlived by underitandlng theoretical ly a particular process ^ 
eg* the variation in the duration of vowelt, or the phonetic difference 
between Finnish /B/ and English /§/. This may noti however, apply to 
coafticulatory rules. They may be learnt best by listening how a native 
speaker "sounds". It may also be unnecessary to learn them, or some of 
them. 

Finnish students seem to have difficulties even in fairly simple 
structures of Inglish. This concerns both the duration and the quality 
of segments. The main function of vowel reduction seems to be the correct 
adjustment of the prominent and unprominent elements in a sentence. If 
this is ignored, the fluency of the whole utterance is jeopardized. 
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THE ENGLISH /ptk/-^/bdg/ DISTINCTION: DATA AND DISCUSSION 



Karl Sua mi 
UnlvQhbitij Data 



1. Acoustie Correlates 

The major acoustic correlates of the English /ptk/- 
/bdg/ distinction are well known and uncontroversial and 
therefore we shall only briefly mention the characteristics 
of the two sets in the word initial, word medial and word 
final positions, without inaking explicit reference to 
published data. We shall assume that the word initial and 
final positions are at the same time also utterance initial 
and final, respectively* The effects of stress, emphasis* 
position in the sentence etc* will be excluded although, 
admittedly, they interact with the more segmental de- 
terminants of the opposition. Temporal differences between 
the two sets will be dealt with separately in section 3.5. 
below. 

in the word initial position the distinction is main- 
ly manifested by a presence of aspiration in /ptk/ vs. 
its absence in /bdg/, both sets normally having voiceless 
occlusions, although in the latter the voicing of the 
occlusion is optional. In addition to having aspiration 

a lag in the onset of voicing /ptk/ are also charac- 
terized by a stronger (homorganie) explosion noise than 
/bdg/. The difference in the timing of the voice onset 
causes also other concomitant acoustic effects, e.g. the 
more marked rising of the first formant (Fl) of the ad- 
jacent vowel in the case of /bdg/ and the first fonnant 
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eutback In /ptk/, and the fact that in general a larger 
portion of the formant transitions are voiceless and 
masked by aspiration noise in /ptk/ and therefore perhaps 
less useful as identification cues for the /place of ar- 
ticulation/ feature than in /bdg/ in which the formant 
transitions are voiced and more clearly audible (al- 
though this is counterbalanced by the stronger homorganic 
explosion burst in the former) . It remains to be said 
that the initial pitch of the following vowel is often 
higher for /ptk/ than for /bdg/. Some of the differenees 
just mentioned accompany the distinction, with appropri- 
ate modj f ications I also in the other positions. 

Medially, between vowels ^ /bdg/ are usually voiced 
throughout the occlusion and the explosion burst is weak. 
In /ptk/ the occlusion is voiceless ^ followed by a strongs 
er explosion burst, the duration of aspiration depending, 
among other things, on the proximity of stress* 

Word finally the two sets are mainly distinguished 
by the duration of the preceding vowel i before /bdg/ the 
vowel has a much longer duration then before /ptk/. The 
voicing of the occlusion of /bdg/ in this position is 
optional, ranging from no voicing to full voicing, al- 
though excessive voicing in this position is rare* The 
occlusion in /ptk/ is voiceless. If the stop is released, 
the burst is again stronger in /ptk/. 

2. Stop Production and the Mechanisms of Voice 
Hegulation and Aspiration 

2.1. Voiced Stops 

According to the myoelastic-aerodynamic theory of 
phonation (see e.g. van den Berg 1968, Ladefoged 1912, 
1975, SQderberg 1976) two conditions must be fulfilled 
for vocal fold vibration to occuri firstly, the vocal 
folds must be appropriately positioned (in the slightly 
adducted position, the so-called voice state, see e.g. 




L«defog«d 1973), and seeendly, a sufficient transglottal 
flow of air must ha provided. Given the fulfilment of 
theie two conditions, the applioation of the Bornouill 
effact will cauie spontaneous vocal fold vibration. The 
presence vs. absence of a relatively free passage of air 
through the supraglottal cavities of the vocal tract is, 
of course, the basis for the division of consonants into 
reoonants and obstruents, respectively. The aerodynamic 
conditions prevailing in the vocal tract in obstruents 
(and particularly during the occlusion of stops) being 
antagonistic to the transglottal flow of air and thus to 
voicing, how can the occurrence of glottal vibrations 
during the occlusion of /bdg/ be explained, asauming the 
oorreotnesB of the myoelastic-aerodynamic theory? 

The problem, of course, consists of finding the 
mechanisms that are used to maintain a sufficient trans^ 
glottal pressure drop while the vocal tract, from the 
lungs to the constriction, forms a closed system with 
essentially two oavitlos, the subglottal and the supra- 
glottal one, connected by a valve, the glottis. Rothen- 
berg (igfia. 91) ha, calculated that, without any sfeecl^i ■ 
adjustments affecting the volumes oS the two cavities 
the equalization of the Initial pressure difference would^ 
occur in about 4 msec and that, allowing for a passive ' 
expansion of the pharyngeal walla, a prolongation of 
voicing up to 20-30 msec after the beginning of the 
occlusion could be accounted for. However, more powerful 
mechanisms must be looked for to ...plain the often longer 
voiced closure durations of /bdg/. 

Perkell (1965) conducted X-ray motion picture 
studies that indicated a greater pharyngeal cavity width 
during the articulation of /d/ as compared to /t/ in 
English. This finding was interpreted by Parkell {and 
later by Chomsky and Halle 1868: 325) to Imply that in 
the production of /t/ the walls of the vocal tract are 
rigid as a result of muscular tension whereas during /d/ 
the walls are lax and able to expand to permit the 
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inereaBe of volume necessary for a prolongation of voice 
during the oooluBlon. Thus, according to Perkell, the 
expanBlon of the pharynx In /d/ would occur passively, «s 
a result of the vocal tract being compliant. 

Kent and Moll (ISii, 1552 1, using cinef luorographic 
films, found out that /bdg/ (in Inglish) are oharaoter- 
Ized by a lower hyold bone poiltion and « greater tongue- 
. pharynx width than /ptk/. It was further discovered that 
there was an actual increase in the volume of the supra- 
glottal cavities during /bdg/, whereas there were smaller 
increases, no increases at all or a slight reduction in 
cavity siie during /ptk/. However, the authors prefer the 
■xplanation that pharyngeal expansion during the voiced 
stops could occur as a result of an active mechanism 
(p. 1554). They point out that the changes in vocal tract 
compliance proposed by Perkell are not very likely in 
View of the fact that the intraoral pressure is every- 
where reported to be higher for /ptk/ than for /bdg/,. 
at least for the initial portions of the occlusion (for 
a discussion of intraoral air pressure data see section 
3.2. below). With regard to the intraoral pressure alone, 
then, one would rather expect the pharynx walls to he 
Wider apart and the oropharyngeal cavity volume to be 
larger in /ptk/ and not in /bdg/. Similarly, it can be 
argued that the tensing of the cavity wall, during the 
occlusion of /ptk/ (which, as the data diseussed below 
Indicate, is a claim not very strongly supported by ex- 
perimental evidence) resulting in a smaller volume of the 
oropharyngeal cavity would have to be so strong as to 
overcome the expansive influence of the higher intraoral 
preaaurg * 

Electromyographic investigations of the muscles in- 
volved in pharyngeal cavity expansion (Ball-Berti 1975, 
Bell-Berti and Hirose 1975) reinforce in general the al- • 
ready familiar pattern, i.e. /bdg/ are characterised by a 
pharyngeal cavity expansion while their voiceless cognates 
are not. Mora spaoifisally , the authors try to assess 
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whether a passive or an active expaniiion takes place in 
the voiced atops. However, their interpretation of the 
terms 'active' and 'passive- differs, it seems to me, from 
the use of these terms by Rothenberg. in Rothenberg (1968) 
a passive expansion is one that occurs as a result of the 
influence of the steadily growing intraoral air pressure 
on the compliant structures of the oropharyngeal cavity, 
without the interaction of any muscular activity of the 
structures themselves. In Bell-Berti (1973) , on the other 
hand, a passive expansion is one that results from the re^ 
laxation of the activity of certain pharyngeal wall mus- 
cles, while an active expansion is one that results from 
an increase in the activity of certain other muscles. Thm, 
in this framework, the possible influence of the intraoral 
pressure does not come to play at all. Bell-Berti found 
out that, within the framework used, it is not possible to 
predict whether a given subject will exercise active or 
passive control, or a coitoination of these, for variatione 
in the volume of the supraglottal cavity for voicing dis^ 
tinctions in English^ This finding Bpll-Bcrti inLbipr^Ls 
to show that the fgaLure It tensej , propounded by Chomsky 
and Halle (1968) , is inadequate for describing the phar^ 
yngeal volume changes concomitant with voicing distinc^ 
tions. The electromyographic investigations of Minifie 

li- (1974) showed an apparent inhibition of pharyngeal 
constrictor muscles in /b/ and cQnsiderably less reduction 
for /p/. The authors mention the possibility to interpret 
this finding to support the position that the transglottal 
pressure differential is actively controlled via changes 
in the oropharyngeal volume, these changes contributing 
to the maintenance of vocal cord vibration during the 
occlusion of voiced stops. But the authors are cautious to 
point out that to document an active enlargement of the 
pharynx one would have to observe reciprocal activity in 
the muscles antagonistic to the pharyngeal constrictors. 
However, if we reinterpret the findings of Bell^Berti and 
Minifie et al*, defining, in aecoMance with Rothenberg 
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(1968), passive expansion as one caused by intraoral 
pressure alone and active expansion as one involving ad-- 
ditional muscular activity of any kind, whether a relaxa-- 
tion or inhibition of some muscles or an increased activi= 
ty of some other muecles* we can conclude that theg© re- 
sults give further support to the hypothesis of an active 
expansion of the supraglottal cavity in /bdg/ although, 
as the result obtained by Bell-^Berti Indicate, the actual 
mode of muscle activity is complex and may vary from per- 
son to person. 

On a priori theoretical grounds, too, the hypothesis 
of a passive expansion (i.e. one caused by intraoial 
pressure alone) seems less attractive as it implies yet 
another obstruction to the transglottal flow of air while 
the problem, as was stated at the outset of the present 
section, is to find the mechanisms that facilitate the 
prolongation of the transglottal pressure differential. 
At any rate, the relative volunies of the subglottal and 
supraglottal cavities put a limit to the effectiveness of 
even an active use of this mechanism if not accompanied by 
additional, or concomitant, mechanisms. 

Another possibility of prolonging the transglottal 
air flow is to lower the entire larynx. It may be pointed 
out that this mechanism, in contrast to the ones discussed 
so far, could ensure sustained phonation even if the sub- 
glottal and intraoral air pressures were equal at the be^ 
ginning of the occlusion. The larynx at large can act as 
a piston, decreasing the volume of the subglottal cavity 
and increasing that of the supraglottal one, thus creating 
a transglottal pressure drop. This mechanism is in fact 
similar to the one used in implosive stops although these, 
in contrast to the English Stops, involve a glottalic 
ingressive airstream mechanism. 

We have already mentioned the finding of Kent and 
Moil (1969) that the hyoid bone is lowered during the 
occlusion of English /bdg/. The authors interpret the 
finding to support the view that the larynx is drawn 
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downward in these stops and that# in view of the large 
masses involved, the depression must occur as a result of 
an active mechanism. More specifically, the authors sug= 
gest (p. 1554) that the hyoid bone is depressed by an ac- 
tivation of the extrinsic musculature of the larynx and 
that the depression usually co-occurs with pharyngeal ex^ 
pansion which, as was discussed above, is at least in part 
controlled by the constrictor muscles of the pharyngeal 
walls. Thus pharyngeal expansion and the enlargement 
caused by a depression of the larynx are perhaps not in^ 
dependently controllable gestures. From a functional point 
of view, at least, both serve the same purpose in this 
particular context . 

Further indirect evidence for larynx lowering in Eng- 
lish voiced stops is provided by Netsell (1965) who, on 
the ^ ":'ts of the essentially invariant subglottal pressure 
contour observed in the occlusion phase of /d/ suggests 
that the observed continuous transglottal pressure drop is 
the sole result of a volume increase in the supraglottal 
cavity. More directly, the larynx movement measurements 
using the thyrouinbrometer by Ewan and Krones (19 74) indi= 
cate, among other things ^ that voiceless stops (in English^ 
Thai, French and Hindi) generally have a higher larynx 
position than corresponding voiced stops, with this dif-- 
ference in position being greater at or near the end of 
the stops. The last observation, it may be pointed out, 
answers well the demand for a prolonged transglottal pres^ 
sure drop. 

The rema i n ing niechan ism is incomplete ve lopharyngeal 
closure, i*e, a leakage through the nasal cavities, men^ 
tioned by Rothenberg (1968i 99-106). The only information 
concerning the possible use of this mechanism in English 
available to the present writer is the observation of 
Kent and Moll (1969 t 1555) that one of their subjects gave 
evidence that velopharyngeal opening is more common during 
voiced than voiceless stops. 
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So far only machanisms enhancing the transglottal 
flow of air during the occlusion have been diseussed, no 
attention having been paid to the role of the voeal folds 
themselves. For example, Chomsky and Halle (1968: 301) 
have argued, on purely theoretioal grounds, that voicing 
in obstruents involves quite different adjustments of the 
vocal cords than does voicing in resonants. Referring to 
the theoretical investigations by Halle and Stevens {1967) 
they olaim that in obstruents periodic vocal cord vibra- 
tions can be maintained only if the open phase during each 
glottal vibration is lengthened over that normally found 
in vowe3G and that **the increased glottal opening would 
also help to maintain the vibration in the face of the 
redueed pressure drop across the glottis resulting from 
the buildup of pressure behind the consonantal constric- 
tion in the supraglottal cavity'' (Chomsky and Halle I968i 
301), As evidence they state ''certain well-known observa- 
tions", for example that the (transglottal?) air flow in 
voiced obstruents is noticeably faster than that in reso- 
nants . 

The above theoretieal argumentation, however, besides 
lacking experimental confirmation, is untenable also on 
theoretical grounds. How, in the presenoe of the aerody- 
namic constraints discussed above, could a larger open 
phase of the glottal vibrations and the resulting faster 
transglottal air flow help to prolong voicing during stop 
occlusion? one would rather wish to postulate a shorter 
open phase and slower transglottal flow to delay the 
equalization of the transglottal pressure drop. As for 
exparimental data Ladefoged (1973i 78-79), on the basis of 
his own quantitative data on the relation between sub- 
glottal pressure, airflow, and frequency of vibration of 
the vocal cords (Ladefoged 1967), claims that there is in 
fact a decrease in transglottal flow in voiced obstruents. 
Moreover, he claims that the lowering of pitch observed 
during and after voiced stops (see e.g. House and Fair- 
banks 1953) is sufficiently accounted for by this decrease 
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in transglottal flow, without a need to presuir^e any ad- 
justir.ents of the pQeitisns of the laryngeal structures. On 
the other hand, stavens (1977: 270-272) argues that larynx 
lowering and raising in voiced and voiceless stops, re- 
spective ly, are mere concomitants of the mote primary 
gestures of vocal cord slackening and stiffening. However, 
as adinitted fay Stevens (p. 278), the feature system con- 
taining, among other features, vocal cord slackening and 
stiffening is only speculative in nature. 

In sum, then, we conclude that, to the best of our 
knowledge, the mechanisms facilitating the prolongation of 
the transglottal pressure difference through an expansion 
of the oropharyngeal cavity seem to be the - both theoret- 
ically and experimentally ^ best motivated explanations 
for the occurrence of voicing during stop occlusion. In 
addition, adjustments of the vocal cords themselves may 
have an effect. Experimental evidence for such adjustments 
is not available, however, and theoretical considerations 
do not suggest them either, given the sufficiency of the 
aerodynamic measures, 

2.2. Voiceless Aspirated Stops 

Aspiration is the voiceless interval from the release 
of a voiceless stop to the onset of voicing during the 
following vowel or resonant (preaspiration being the in- 
terval from the offset of voicing in the preceding vowel or 
resonant to the implosion of a stop? this rather special 
case will not be discussed below). Acoustically, the re- 
lease of aspirated stops usually consists of a short period 
of strong turbulent noise similar to the homorganic frica-^ 
tive, followed by fhj -li)<e frictional noise of less inten- 
sity and containing spectral information about the transi- 
tion of the articulatory configuration appropriate for the 
occlusion toward that appropriate for the next articulatory 
target. Although it may be difficult, in an acoustic 
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analysis, to segment the whole of the release into these 
components they should, nevertheless, be regarded as 
separate phenomena on the basis of their (at least eemi-) 
independent eontrollability , The homorganic fricative 
noise is generated at the place of the occlusion and its 
strength and duration are, roughly, the function of the 
intraoral pressure at the moment of release and the rate 
at which the constrictory articiaators move apart after 
the release, respectively. The high^^r the intraoral pres- 
sure at the critical moment, the stronger the homorganic 
fricative noise, ceteris paribus. Similarly, the slower 
the movement apart of the constrictory articulators, the 
longer the duration of this noise- In affricates, of 
course, the f ricative-1 .ike constriction following the 
release is maintained much longer than in stops and may, 
indeed, be as long as the whole aBpiration of a heavily 
aspirated stop. The ^hj -like frictional noise following 
the explosive burst, on thu r^Uher hand, is generated at 
the qlottis and its duration is a function of the time 
taken for the glottis to return, from an open position at 
tho release, to the position appropriate for voicing (see 
below) . Thus, as the existence of affricates (or affri- 
cated stops) and aspirated stops indicates, the articula- 
tory mechanism offers possibilities for an independent 
control of the durations of the homorganic eMplosive noise 
nnd of aspiration. Yet, as wo shall see in discussing the 
fcinglish stops in more detail, the former can be highly 
dependtint on the mechanisms responsible for aspiration. 

Another question is whether, perceptually, it is the 
lack of voicing during the release or the presence of the 
concomitant noises that is the more important acoustic cue 
for the identification of an aspirated stop. The relative 
strengths of these cues can only be assessed through 
perceptual experiments involving synthetic or distorted 
speech: in norma' -inr-* :h production the cues cannot be 
independently cc:v iOd* 
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The physiQlogical rnechaniams underlying aspiration 
are well du amentedi aspiration is effected by a glottal 
abduction gesture, criuaing the glottis to be open gt the 
time of the release of the oral closure. Lisker et al. 
(1969) and Sawashima et al. (1970) have found out, using 
transilluJtiination of the larynx in running speech by a 
fiberoptlcs system, that the Qpenlng of the glottis char- 
acterising aspiration in English is effected by con= 
trolling the arytenoid cartilages which are responsible 
for the opening and eloeing movenients of the glottis* 
Similar oLservations about voiceless,, aspirated stops have 
been made in a number of languages, see e.g. Fr^kjaer- 
Jensen et £l, (1973) for Danish^ Sawaghima and Miyazaki 
(1973), Sawashima and Niimi (1974), Hirose and Ushljima 
(1978) for Japanesei Kim (1970), Kagaya (1971), Abberton 
(1972), Hirose et a^. (1974) for Korean; and Lindgvist 
(1972) and Lofqvist (1975) for Swedish, 

According to Kim (19 70) , the duration of aspiration 
is a function of the size of th^ alottal opening at the 
time of the release of the oral cioBuret the wider the 
glottis at the release ^ the longer the aspiration. Mora 
specif ieally, Kim's view (p. 112) is that the instructions 
to close the glottis (towards the position appropriate for 
voicing) after the release are simultanoouB (relative to 
the moment of release) for all voiceless aspirated stops 
regardless of the differences in the durations of their 
aspiration, these different duratione being the result of 
differences in the size of the glottal opening at the 
moment of release, Lisker and Abramson (e.g. 1971), on the 
other hand, maintain that the timing of the instructions 
to close the glottis is responsible for the differences 
in the duration of aspiration, emphasizing thus the Impor-^ 
tance of the timing difference as an independent physiolo- 
gical mechanism. 

Conf irmation of Lisker and Abramaon's view comes from, 
the electromyographic (EMG) studies of the intrinsic lar- 
yngeal muscles in voicing control conducted by Hirose and 
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Gay (1972} . They studied the EMG activity of a number of 
thase musQles in the speech of two American English 
speakers and observed that tha posterior cricoarytenoid 
(PCA) muscla, an abductor causing the arytenoid oartilageg 
(and consaquently tha vocal folds) to ba separatad, ac^ 
tively participates in laryngaal adjustments, particularly 
for the /ptk/^/bdg/ distinction (for similar results in 
Japanese see Hirose and Ushijima 1968) , Thera was a con- 
sistent increase in PCA activity for /ptk/ regardless of 
phoneme environment, and no such activity far /bdg/ . m 
sum, the authors state that their data "would suggest the 
ubiquit- of an indepandent timing control machanism" {p. 
161) , To this may be added that the existance of proaspi= 
rated stops also gives strong support for possibilities 
to control the relative timing of glottal and supraglottal 
articulations (preaspirated stops involving a glottal 
abduction gesture in advance of the beginning of the oral 
constriction) . 

As a result of the glottal abduction gesture vocal 
fold vibration is inhibited until the glottis again returns 
to the adductad position. The voicelGssness of the 
occlusion is thus extended over to the next segment. 

3. Physiological Correlates 

There is great variation in the acoustic manifesta- 
tions of the English /ptJ</-/bdg/ distinction in different 
environments. One aim of the physiological invostigations 
has been an attempt to find, at some level of speech pro- 
duction prior to the acoustic one, context -independent, 
invariant correlates of the linguistic opposition in 
question. The context-dependent acoustic output variations 
in the realizations of this distinction could then be 
eKplfllned to result from automatic, universal peripheral 
constraints, anatomical, inertial, aarodynamic or other- 
wise. The use of such pairs of terms as '•voiceless"- 
"voiced", "fortis"-' lenis% or "tense"-"lax" often implies 
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an acknowladgenent of the existence of such a level. The 
discovery of such a level must await future research^ in 
the meantime* we shall see that mome of the suggested 
terms seem, in the light of the experimental evidence 
available today, nore felicitous than some others. Nest, 
data on the most important physiolsgical parameters of the 
English /ptk/-bdg/ distinction will be surveyed, 

3.1, Subglottal Air Pressure 

The feature ^heightened subglottal pressure' was in- 
voked by Chomsky and Halle (19S8i 326) to explain the 
aspiration in / ptk/\ The relevant parts of their feature 
system have been extensively oriticised elsewhere (see e,g, 
Liske^r and Abramson 1971, Ladefoged 1973); here it is 
sucfioient to point out that experimental investigations 
have failed to assess significant differences in sub- 
glottal pressure in the production of /ptk/ and /bdg/ in 
English. Thus Netsell 11969), for example, using three 
speakers of Anierican English as subjects, made simulta- 
neous recordings of subglottal and supraglottal air pres- 
sures and came to the conclusion that the tespli_itory 
system generates an essentially invariant driving pressure 
with regard to the /ptk/-/bdg/ distinction. Similar re- 
sults have been obtained by Ladefoged (1967: 41-44), 
Lieberman (1967; 97) and Shipp (1973). Thus we can con- 
elude that the lungs do not actively participate in the 
d i g t inc t ion . 

3,2, Supraglottal (intraoral) Air Pressure 

Peak intraoral air pressure measurements generally 
show higher values for members of the set /ptk/ than for 
/bdg/ (Malecot 1966a, Arkebauer et al, 1967 ; Netsell 1969; 
Lubker and Parris 1970 1 Tatham and Morton 19731 Warren 
and Hall 1973; Shipp 1973) - However, the peak pressures 
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teached in the two aats arii often similar, and what 
really dlitlngulshes them is that in /ptk/ the intraoral 
pressure reaches its maximum {1... the level of the sub- 
glottal pressure) almost immediately after the beginning 
of the occlusion, while in /bdg/ there ia a gradual in-' 
crease with the maximuin pressure occurring at the end of 
the occlusion and often not reaching the level of the sub- 
glottal pressure (see especially Lisker 1970 and Shipp 
1973) . The explanation for this difference Is in the 
activity of the glottis; in /ptk/ the glottis, as a result 
of the abduction gesture, is wide open soon after the 
Implosion, enabling the almost instantaneous equalization 
of the suhglottal and intraoral pressures, whereas in 
/bdg/ the glottis acts as a kind of valvi, letting only 
small quantities of air at a time to escape through the 
larynx. Moreover, the task of the active mechanisms of 
oropharyngeal cavity expansion discussed above is to delay 
the equalization of the transglottal pressure drop. 

Thus, although a re liable lnd#.s of the /ptk/-/bdg/ 
distinction, the intraoral air pressure is not an inde- 
pendent parameter nor does it reflect differences in sub- 
glottal air pressure. Differences in intraoral air pres- 
sure are simply caused by differences in the valvular ac- 
tivity of the glottis, they are c-ncomitanta of glottal 
adjuatnients effected by the intrinsic laryngeal muscles, 

3.3. Muscular Tension 

MalScot (19G6b) studied the mechanical pressures 
exerted during stop occlusion in American English. Accord- 
ing to his measurementa there are no significant differ- 
ences between the two sets of stops in this respect - Kent 
and Moll (1969: 15S5) conclude that homorganic stop and 
nasal consonants are produced with a common gesture of the 
constrlctory articulator. Similarly, according to Luhker 
and Parrls (1970. 625) who measured intraoral air pressure. 
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force oi labial contact and labial EiMG activity in /p/ 
and /b/ , the labial gesture for the two stops is essen- 
tially monotypic, roquiting no more forceful labial con- 
tact for one than for the other. Further EMG studies by 
Harris et al . Ui65), Promkin (1966) and Tatham and Morton 
(1969) indicate no consi^tant difference in EMG peak ac- 
tivity between /p/ and /b/ . The simultaneous recordings 
of upper lip, lower lip and jaw movements concomitant 
with Intramuscular EMG during /p/ , /b/ and /m/ by Sussman 
SJ: a^. (1973) indicate that^ depending on context and the 
particular articulator displacement or velocity parameter 
or muscle activity investigated, any of the three conso- 
nant b can Involve the greatest displacement, the greatest 
velocity or have the highest level of EMG activity. Thus, 
although in the preparatory activity in anticipation of 
the stop closure /p/ had the fastest jaw elevation speed, 
the greatest amount of upper lip lowering and the fastest 
upper lip lowering as a result of complementary contribu= 
tion of aerodynamic and neuromuscular forces* /p/ showed, 
on the explosion side of the stop, **the slowest mean lower 
lip depression speeds, the slowest jaw depression speeds 
across all vowel contexts, the slowest net jaw + lower 
lip depression velocity, and the smallest jaw and lower 
lip displacement following the release. The voiced stop 
/b/ was faster than /p/, but slower than /m/ in all these 
measures. However, the upper lip does not follow this 
hierarchy and, in fact, is opposite, with the net effect 
being the cancellation or minimiiation of any differences 
between the postocclusion maneuvres for the three stop 
cognates" (p. 416), However, the authors state (p. 415) 
that "where meaningful phonetic comparisons could be made, 
we often found small differences between the three stops /p/, 
/b/, and /m/ in terms of these three form of data artic- 
ulatory displacement, articulator velocity, and averaged 
EMG activity, K.s.J , We have made some statistical ana- 
lyses of these differences, and though many of the differ- 
ences are not statistically significant, the trends are 
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often consistent across the three dependent variables and 
subjeGts'S and further (p, 402) that *^we found that in 
terms of which muscles were operating and the approximate 
timing of their operation, the three bilabial consonants 
were basieally equivalent". 

We have already touched upon the features 'tense' and 
'lax' in GOnneGtion with mechanisms of oropharyngeal cavi- 
ty expansion, if 'tensenags' is interpreted to involve 
activity of the pharyngeal musculature we should rather 
have to label /bdg/ 'tense' as they and not /ptk/ were 
seen to involve such activity. In fact MalScot (1970) 
claims that the /ptk/Vbdg/ (or * f ortis ' - ' lenis * ) distinc- 
tion has litcle or nothing to do with articulatory energy 
but is a synesthetic interpretation by native speakers of 
the intraoral air pressure differences accompanying these 
two sets. 

On the basis of the above discussion we venture the 
following conclusion: due to the variable aerodynamic 
conditions prevailing in the mouth as a result of differ- 
ent glottal adjustments in /ptk/ as against /bdg/ there 
are some minor differences between the two sets in the 
organization of the supraglottal constrictory articula- 
tionsi but by and large the supraglottal constrictory 
gestures are similar for the cognate pairs and the ex- 
isting evidence overwhelmingly points towards a different 
source for the /ptk/-/bdg/ distinction, the features 
' tense '-- 'lax ' thus lacking experimental confirmation as 
chaeacterizerE of the English stops. 



2.4, Laryngeal Mechanisms 



The aspiration characterizing the English /ptk/ is 
effected by a glottal abduction gesture (for data eee 
section 2.2, above), causing an active devoieing of the 
occlusion and part of the following vowel. The maintenance 
of a sufficient pressure drop across the glottis requisite 
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for sustained phonation in /bdg/, on the other hand,- im 
effected by an enlargement of the oropharyngeal cavity 
(for data see section 2,1.. above). The oropharyngeal 
cavity expansion takes place as a result of pharyngeal 
cavity expansion or larynx lowering ^ or a combination of 
these. The interrelations between the muscles controlling 
theie two types of eKpanslon are not well knowni Kent and 
Moll {1969 r 1554) suggest that the hyoid bone depression 
and the co-occurring larynx lowering observed in /bdg/ 
are effected by an activation of the extrinsic musculature 
of the larynx and that this depression usually co-occurs 
with pharyngeal e>cpansicn Which is caused, at least in 
part, by the constrictor muscles of the pharyngeal walls 
{Bell-Berti 1975) , 

In sum, then, we can conclude that the mechanism 
responsible for aspiration and for an active devolcing 
of the occlusion in /ptk/ is located in the glottis, in 
the activity of the intrinsic laryngeal musculature. 
The oropharyngeal cavity expansion operating in /bdg/ is 
controlled by the extrinsic laryngeal muscles or by the 
muscles of the pharyngeal walls, or both. 

3,5, Durational Differences in the Supraglottal 
Articulations 



In this section parameters of two types will be 
discussed, viz, the duration of the occlusion in the two 
sets of stops and the influence of the /ptk/^bdg/ dis- 
tinction on the duration of the adjacent vowels. 

On the whole, rather little= infontiation is available 
about the relevant durational patterns, especially with 
regard to the duration of the occlusion in the non-medial 
positions and with regard to the effect on the adjacent 
segments in the word Initial and medial positions. More-^ 
over, the data obtained for these positions are often 
conflicting* As for the word initial position, Lisker {1972) 
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concludes that Ht appears that a differonce in closure 
duration is far from being a regular feature of the 
contrast betv/een English /ptk/ and /bdg/" (p. 342) . how-^ 
ever, this conclusion is based on the measurement of the 
productions of a single speaker only. Mutanen (i973i 41) 
reports an equal duration of the occlusion for the labial 
cognates but shorter durations for /tk/ than for /dg/ . 

In an assessment of the possible effect of the word 
initial /ptk/-/bdg/ distinction on the duration of the 
following vowel the placement of the boundary between the 
stop and the vowel is of crucial importance since the 
inclusion of the release phenomena in either the stop or 
the vowel makes a big difference. Thus Peterson and Le- 
histe (I960), for example, conclude that the average 
duration of a syllable nucleus was shorter after a voice- 
less consonant than after a voiced consonant if aspiration 
was excluded from this measure, if, on the other hand, 
aspiration was considered part of the syllable nucleus 
the latter was longer after a voiceless consonant. In 
conclusion, the authors state that the influence of an 
initial consonant on the duration of the (following) 
syllable nucleus followed no simple regular pattern. Al- 
though the segmentation conventions followed in Suomi 
(1976) were somewhat inconsistent in this respect the same 
conclusions can, after some computation, be made on the 
basis of the results obtained (pp, 20-22). That is, dis- 
regarding differences due to /place of production/, vowels 
following /ptk/ were longer if aspiration was considered 
part of the vowel and, if aspiration was eKcluded , no 
consistent differences between the two sets could be ob^ 
served. 

The rnsulti obtained for the medial position are 
Similarly controversial. Thus Giitison (1970.- 154), for 
example, on thm basis of an sxplioit reference to Lisksr-s 
(19S7) maasuremento, maintains that the closure duration 
is longer medially in /ptk/ than m /bdg/ . However, a 
Closer eKamination of Lisker's paper reveals that the 
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closure duration of the labial cognate pair was found to 
be different only when preceded by a stressed syllabic 
and followed by an unstressed one^ there being no signif- 
icant differences under the other conditions. Similar 
results were later obtained by Lisker (1972) on the basis 
of intraoral pressure recordings although « again , in the 
speech of a single infonnant^ 

Kent and Moll (1969) found no differences in the 
durations of the articulatory closures of the cognate 
pairs in a VCV frame where equal stress was to be placed 
on both of the syllables, Lubker and Parris (1970) on the 
other hand, found out that/ depending on the placeinent of 
stress in the utterance, either /p/ or /b/ had a longer 
duration of labial contact. The differences between the 
cognates In each particular utterance type were, however ^ 
at most 12 msec. Thus/ while the duration of the occlusion 
was greatly affected by variations in the placenient of 
stress^ both cognates seemed^ by and large, to follow 
similar patterns. Similarly, the results obtained in Suomi 
(1976 1 28-37) indicate no consistent differences in the 
duration of the occlusion (as assessed by acoustic meas"- 
urements) between cognate pairs (that iS| although sig- 
nificant differences were obtained between /p/, /k/ and 
/h/ f /g/ , respectively I the differences in mean durations 
were negligible and there was ^ moreover, an opposite trend 
with regard to /t/ and /d/) , 

The conclusion to be drawn from the above discussion 
is^ then I that although the duration of the occlusion of 
word medial stops is greatly influenced by e,g. stress 
the two sets are similarly affected. The majority of the 
available data suggests that the intere agnate differences 
are at most unsystematic and small . This suggests that the 
duration of the occlusion of word medial stops cannot be 
effectively used as a cue for the /ptk/-/bdg/ distinction 
although investigations examining these sounds in as many 
different environments as possible are clearly needed. 
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Experimental data on the effect of the word medial 
/ptk/-/bdg/ distinction on the duration of the preceding 
vowel are hard to find. A careful inspeotion of the 
"classical" literature on vowel duration revealed that the 
vowels examined preceded word final consonants in mono- or 
disyllabic words (e.g. Belaeco 1953, House and Fairbanks 
1953, Zimmerman and Sapon 195«, Peterson and Lehiste 1960, 
House 1961) and the data, therefore, are not relevant here. 
The only data from this period that are directly relevant 
to the present discussion are those by Sharf (1962) who 
has measured the duration of vowels preceding word medial 
consonants. According to Sharf (p, 29) the duration ratio 
of vowels preceding word medial /p/ and /b/ is about 3:4 
and that of vowels preceding /k/ and /g/ about 4i5 (the 
single speaker used as an informant used an alveolar flap 
for both /t/ and /d/ and, consequently, no corresponding 
differences in these stops could be detected). However, 
the results obtained must be viewed with some scepticism. 
The measurements are based on a single speakeri the number 
of occurrencies of the different structures is not given; 
the segmentation conventions are not stated; and the words 
were recorded directly on to the tape-loop of a spectro- 
graph (with little possibilities of checking the natural- 
ness of the recordings afterwards). Wiik's (1965: 114-117) 
data refer to vowels before word final consonants in 
monosyllables. Small but unsystematic differences in vowel 
duration before word medial /ptk/ and /bdg/, respectively, 
were obtained in Suomi (I976i 28-37) on the basis of 
acoustic measurements. The results indicate a slight re- 
ciprocal tendency between the durations of the occlusion 
of the medial stops and the durations of the preceding 
vowels: in the case of labial and velar stops the /voice- ' 
less/ cognate exhibited a longer closure duration and a 
shorter duration of the preceding vowel but in the alve- 
olar stops there was no difference in occlusion duration 
and the preceding vowel showed an inverted pattern compared 
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to the other places of production. On the whole/ the 
obtained differences were negligible and could be the 
result of a eystematic error in segmentation in the two 
sets of stops. Scully {1974} has pointed out that the 
duration of an articulatory occlusion cannot be directly 
inferred from the duration of the resultant acoustic 
occlusive segment, and this may have biassed the results 
somewhat . 

As for the influence of the word medial /ptk/-/bdg/ 
distinction on the duration of the following vowel, no 
data whatsoever could be found. 

We must draw the conclusion, then^ that durational 
differences in the supraglottal articulations do not seem 
to differentiate word medial, intervocalic /ptk/ from 
/bdg/ . 

Again it was difficult to find data on the duration 
of the occlusion of word final /ptk/ and /bdg/. Suen and 
Beddoes (1974) obtained longer closure durations for /ptk/ 
than for /bdg/, the average difference between the two 
sets being of the order of 33 msec. The measurements of 
Suoml (1976 ^ 48-53} indicate a statistically strongly 
signrificant difference between the two sets to the effect 
that /ptk/ had a longer closure duration^ the differences 
between the cognate pairs being 20 to 30 msec. Similar 
differences between word final /p/ and /b/ were obtained 
by Lubker and Parris {1970 i 631) on the basis of intraoral 
pressure recordings, both after stressed and unstressed 
vowels. The limited but parallel information available 
suggests, then, that in this position ^ in contradistinc^ 
tion to the others, a regular difference in the duration 
of the occlusion accompanies the /ptk/-/bdg/ distinction. 

That vowels and resonants have a longer duration 
before word final /voiced/ obstruents than before /voice-- 
less/ ones in English is such an established piece of 
knowledge that there is no need to repeat the mass of ref^ 
erences to published data here. Similar observations have 
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been made with regard to a number of languages * and it is 
possible that part of the durational variation is con- 
trolled by some kind of a universal articulatary con- 
straint (see, for eKainpiei Zimmerman and Sapon (1958) for 
Spanishi Fintoft (1962) for No^egian, Wiik (1965) for 
Pinnieh (in Wiik's /voiced/ category only one obstruent 
is includedi viz. /d/) , and Chen (1970) for Frenchi Ruaalan 
and Korean) . The prolengation of voweli and reeonants 
before a word final obstruent isi however , much more 
drastic in English than in the other languages i and no 
doubt the extra lengthening is a language speeific feature 
af inglish (see especially Chen 1970) . 

Chomsky and Halle (1968? 301) have argued that the 
lengthening before /voiced/ obstruents is caused by the 
extra time needed by the glottis to shift from the con- 
figuration appropriate for vowels to that appropriate for 
obstruents. However, no evidence for such glottal readjust- 
ments for /bdg/ is available. On the contrary i the results 
obtained by Raphael (1975) seem to indicate that the dura- 
tional variations are primarily controlled physiologically 
by motor commands to the muscles governing the articulators 
that are active in the formation of vowels. From a purely 
functional perspective it could be argued that, given the 
often voiceless occlusion of word final /bdg/, the dura- 
tion of the preceding vowel has assumed (part of) the 
function to signal the word final consonant distinction. 
This contention is supported for example by Raphael's 
conclusion (1972), based on a perceptual investigation, 
that the duration of the preceding vowel is a sufficient 
(and for the types of stimuli used in his experiments, a 
necessary) cue to the perception of the /voiceless/- 
/voiced/ character of a word final stop, fricative or 
consonant cluster. Raphael further concludes that the 
presence of voicing during the closure period does have 
some cue value, although it is minor compared to that of 
the preceding vowel duration. 
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4, A summary of the Phonetic Correlates 

In this section an attempt will be made to relate to 
each other the various phonetic parainetera discussed 
above. This will take place in terms of the three poai" 
tions already referred to, viz. the word initial, medial 
and final positions^ and the references to relevant data 
can be found in the appropriate places above. 

KegardlefS of position, subglottal air pressure does 
not vary as a function of the /ptk/-/bdg/ distinction. In 
a similar way, the available data do not Indicate, in any 
meaningful way, systematic differences in muscular tension 
or in the muscular energy expended. How, for example, 
could we compare the energies needed to abduct the glottis 
in /ptk/ on the one hand and to enlarge the oropharyngeal 
cavity in /bdg/ on the other hand? 

Let us discuss the word initial and medial positions 
first. In /ptk/ there is a glottal abduction gesture 
causing the vocal cords to be wide apart at and after the 
release of the oral constriction. This physiological 
mechanism has several concomitant phonetic consequences. 
First of all, it causes a virtual equalisation of sub-^ 
glottal and intraoral air pressures at an early stage of 
the occlusion, evidenced by the high intraoral pressure 
values reported for these stops. Immediately after the 
release, the glottis is still open, allowing air of high 
pressure to escape not only from the oropharyngeal cavity 
between the glottis and the constriction but from the sub- 
glottal' cavities of much larger volume as well. This ex- 
plains the strong explosive burst, with homorganie friction, 
characterising these stops. At the same time, the return 
of the glottis from the open position to that appropriate 
for voicing causes a delay in the onset of voicing for 
the following vowel or resonant. The duration of the 
resultant aspiration depends on the time taken by the 
glottis to perform this maneuver. 
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It ia a common observation that th# duration of 
aBpiratlon varies also as a function of the place of pro- 
duction of the atops lo that, usually, labial stops have 
the shortest and alveolar or velar stops the longest 
aspiration. It would seem counter-intuitive to assume 
that these differences originate in the glottis itself, ' 
and in fact there are indications that the differences 
are the result of an interplay between invariant glottal 
activity and variable supraglottal articulations. In Suomi 
(19?6i 37-43) it was observed, for example, that there 
was a compensatory relationship between the durations of 
the occlusion and the duration of aspiration in word 
medial /ptk/. These results (based on a mingographic 
analysis) were interpreted (pp. 62-64) to show that by 
postulating a unimodal glottal abduction gesture for the 
whole set the observed differences in the duration of 
aspiration could be eKplained by the differences in the 
durations of the supraglottal occlusions alone, the total 
durations (occlusion + aspiration) being the same for each 
place of production. More direot information of a similar 
pattern was obtained and similar conclusions were drawn 
for Swedish aspirated stops under various conditions by 
LttfqVist (1976) . 

In the two positions mentioned above /bdg/ are 
characterized by a glottal configuration similar to that 
found during vowels and resonants, i.e. the vocal corda 
are in the position appropriate for voicing. The aero- 
dynamic consequence is that, whether there is actual vocal 
cord vibration or not, the passage across the glottis is 
relatively obstructed, preventing the instantaneous 
equalization of subglottal and eupraglottal (intraoral) 
air pressures during the occlusion found in stops with a 
glottal abduction gesture. Whether, at the end of the 
occlusion, the intraoral pressure reaches the level of the 
subglottal one or not (and it must not, if voicing is to 
be maintained throughout the occlusion) , the glottis is 
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still ralatively narrow at the moment of the release of 
the oral constriction, holding baeic the air contained in 
the lungs, and the explosion burst is necessarily weaker 
in these stops than in aspirated ones, ceteris paribus . 
At the same time, the state of the glottis at the release 
enables an immediate application of the Bernoulli effect, 
without any appreciable lag in the onset of voicing for 
the following voiced sound or, indeed, without any break 
in the glottal vibrations in the ease of fully voiced 
stops , 

The extent of voicing during the occlusion depends 
mainly on the effectiveness of the use of the mechanisms 
facilitating the transglottal flow of air through an ex- 
pansion of the oropharyngeal cavity at the expense of the 
subglottal ones {which, in this context, can be regarded 
as a single cavity) . it seems that in English, as already 
discussed, relatively little use is made of these mecha- 
nisms in the word initial and final positions, at least 
in citation fornis. in word medial /bdg/ the occlusion 
is usually fully voiced in all kinds of speech. Now there 
are at least two possible explanations for this dis- 
crepancy with regard to voicing in the three positions. 
On the one hand. It could be the case that because in 
isolated words (citation forms) higher-order grammatical 
boundaries cQ^-occur with word boundaries the frequent 
complete voicelessness of the occlusion of /bdg/ in these 
positions could be caused by allophonic devolcing rules 
sensitive to such grammatical factors. The occurrence of 
full voicing in word medial /bdg/ could then be explained 
as a result of the lack of such bQundarieB in the neigh- 
bourhood. On the other hand, it could be argued that the 
weakly voiced word initial and final /bdg/ represent more 
closely the canonical or target forms of these stops and 
that the full voicing of /bdg/ word medially is caused 
by the proximity of naturally voiced sounds (ideally when 
the stop is between vowels) . In this case we would be 
dealing with tan at least theoretically simple case, of) 
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partial assimilation. This assimilatory force eould also 
conceivably bm used as an explanation for the frequent 
occurrence of completely voiced oeolusions in word initial 
and final /bdg/ in rapid connected speech. The crucial 
point, then, is whether the English /bdg/ are, in com- 
parison to the equivalent stops of, say, the Romance or 
Slavic languages, inherently less voiced or whether the 
various grammatical junctures have a stronger tendency to 
devoice /bdg/ in Ing llsh. Considerations of the phonetic 
manifestations of both sets /ptk/ and /bdg/ in the 
languages in question, and especially their distribution 
on the voicing continuum, would seem to favour the solu- 
tion postulating differences in the inherent degree of 
voicing, in the canonical or target values of these stops. 
However, without extensive cross-language studies we are 
in no position to answer this question. 

It seems, then, that in the word initial and medial 
positions the phonetic differences accompanying the /ptk/-> 
/bdg/ distinction can, by and large, be accounted for by 
the differences in the glottal behaviour alone. 

The word final position is unique for at least two 
reasons* Firstly, the glottal mechanisms responsible for 
differentiating the two sets in the other positions cannot 
be effectively used since, in utterance final position at 
least, the timing of the voice onset for the following 
vowel or resonant is necessarily inapplieable . One of the 
acoustic cQncomitants of the difference in glottal be- 
haviour, the strength of the explosive burst can, admitted- 
ly, operate also here, but the use of this cue is somewhat 
marginal since the English stops are often not released 
audibly in this position. To this may be added that the 
extent of voicing of the occlusion is also a weak cue as 
even the occlusion of /bdg/ is often completely voiceless. 
Secondly, the final position seems to be the only one in 
which large and systematic differences in the supraglottal 
articuiations accompany the distinction, vowels having a 
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longer duration before /bdg/ and the closure duration 
being longer in /ptk/. It is these durational differences 
that are the strongest acoustic exponents of the distinc- 
tion in this position. Thus we cannot generalize by stating 
that ultimately all the acoustic manifestations of the 
/ptK/-/bdg/ distinction in the three major positions can 
be traced back to a rather simple difference in glottal 
behaviour in th^ two sets since the differences in the 
supraglottal constrictory articulations in the word final 
position cannot conceivably be attributed to direct glottal 
control. Yet it is part of the native speakers* intuition 
that /bdg/ remain "the same" and distinct from /ptk/, 
irrespective of their position in the word or larger 
entities. Similarly, it Is part of the task of linguists 
to try and capture this intuition in their descriptions. 
NeKt, an attempt will be made to define the common denom- 
inator uniting the phonetic manifestations of the /ptk/- 
/bdg/ distinction in English in the three positions* 

5. An Interpretation 

Let us assume that., at some unspecified level of the 
phonQlogy of English, the decisive difference between 
/ptk/ and /bdg/ consists of the former being associated 
with less voicing or glottal vibrations than the latter, 
either before, during or after the supraglottal con^ 
atrietory articulation. The phonetic manifestations of 
this difference can be described in terms of such para- 
meters as voice offset time and voice onset time relative 
to the occlusion, with their concomitant differences in 
the eKtent of voicing during the occlusion, the strength 
of the explosive burst, the duration of the aspiration 
etc. Without repeating ourselves too much we can maintain 
that, ceteris paribus , word initial and medial /bdg/ are 
always characteriied by more glottal vibrations in the 
neighbourhood of the occlusion than /ptk/, no matter what 
other differences may be detaeted. More specifically, it 
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Lb the relative amount of voicing during and after the 
otelusion that is important in these poiitlons. 

In the word final stops, however, the same parameters 
cannot be effectively used because the occlusion is (often) 
voiceless in both sets and the acouitic manifestations of 
the differences in glottal behaviour (strength of the 
burst, aspiration) can be of marginal Importance, The 
prolongation of the vowel before /bdg/ and the lengthening 
of the occlusion in /ptk/ can now be interpreted to be a 
means of securing the conformity of the word final position 
to the general principle, the former actually increasing 
the number of glottal vibrations and the latter acting as 
a reinforcement through a change in the vowel/consonant 
ratio. Thus, given the inapplicability of the more usual 
cues and the resultant danger of a neutralisation of a 
linguistic opposition, the two measures can be claimed to 
form, although on a rather abstract level, a conspiracy 
(Kisseberth 1970) with the purpose of preserving the 
opposition. Despite the terminology used it is not here 
claimed that a speaker of English acts according to the 
above considerations? rather,- the discussion is meant to 
provide some possible reasons for the historical sound 
changes that have lead to the present situation. In a 
formal synchronic description it could be postulated that 
the rules reBponsible for decreasing the differences 
between the two sets (e.g. the rule devoicing the occlusion 
in /bdg/) precede the vowel and consonant lengthening 
rules, creating a situation where a neutralization might 
arise although later rules (e.g. allegro rules, see 
Dressier 1975) may again voice /bdg/ and, more generally, 
make the application of the usual glottal mechanisms pos- 
sible ^ 

If the somewhat abstract argumentation above is 
accepted, the following generalization can be statedi in 
English /ptk/ are, ceteris paribus, associated with fewer 
glottal pulses than /bdg/, counted forward (or ''left-to-- 
right") from the beginning of the occlusion if a syllable 
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follows in the same word, and backwards ("right-to-left") 
if not. This algorithm is* of course, directly applicable 
on the ti^juuatic lovul, and Uiu uoncoinitcjnt differences 
can be disrogarded fin the rarely occurring situation 
where we have to decide which of two otherwise identical 
utterances eontains which cogntite) . 

6. Terminological ConsiderationB i iiow to Label thg 
Distinction? 

From a tmrely fornial point of view the choice of a 
name for the classlf icatory feature distinguishing, inter 
aUa, /ptk/ from /bdg/ im a matter of little importance 
as long as the feature is conBistently used. If, on the 
other hand, it is conBidorod desirable {as i doJ that our 
phonological descriptions be as firmly anchored in the 
phonetic foundations as possible, it could be maintained 
that the labels of the dietinctive features should, as far 
as possible, reflect the ^real processes responsible for 
the conveyance of the distinction in speech. The obser- 
vation of the substantive content of distinctive features 
iB also, as far as I eee , one of the cornerstones of the 
theory of m.arkedness {in the sense of e,g. Chomsky and 
Halle 1968) , Moreover , cross-slanguage comparisons (histor- 
ical, typological, pedagogical etc) presuppose an ob- 
jective frame of reference, which is easily obscured if 
too abstract labels are ineptly used. It may be impossible 
to find a perfectly telling and exhaustive label for a 
given distinction but there is, I think, a difference 
between Glearly counterf actual labels and ones that capture 
some aspoct(s) of the phonetic exponents of the distinc= 
tion. Thus, there seems to be some objective motivation 
for an evaluation of the appropriateness of particular 
labels. If the demands suggested above for distinctive 
features are accepted, it follows that a certain group of 
labels must be rejected, namely those that are false, too 
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abstraet or too vaguely defined to hm subjected to em- 
pirical verification. 

The labels suggested for the English /ptK/-/bdg/ 
distinction include "ten@e"-*'iax" , "fQrtis"-"lenls" , 
"aspirated"-^"unaspirated% "t heightened subglottal pres- 
sure", "voiceless"-"volced", and their various para- 
phrases. Let us consider each of these dichotomies in . 
termE of the appropriateness defined above. 

The data surveyed in section 3.3. above indicate 
that the labels "tenae"-^"laK" are inappropriate because 
they lack experimental validation^ the various niore or 
less heuristically selected parameters, intended to cap- 
ture the correlates of this distinction, were seen to 
exhibit no systematic differences between the two sets. 
The Chomsky and Halle (1968) feature "+ heightened sub- 
glottal pressure'' was seen to be proven counterf actual 
Ciee section 3.1.) and must be rejected. The terms 
"fortis''-"lenis", which are defined by Gimson (1970: 151) 
as involving different amounts of muscular energy and 
different degrees of breath effort, are likewlae not sup- 
ported by the experimental data available. If ''muscular 
erj^ergy" here refers to the total energy expended in the 
produetion of the two sets of stops the claim must be 
eonsidered. premature since, at present, it Is not possible 
to measure that energy. "Breath effort" can have two 
interpretations: if it refers to differences in subglottal 
air pressure the claim is counterf actual i and if it refers 
to differences in intraoral air pressure the term is 
superfluous since the differences are not an independent 
variable but an automatic consequence of other, mainly 
glottal, adjustments. The terms *'asplrated"-''unaspirated", 
although consistent with the facts in certain positions, 
must be considered inadequate because of their inability 
to characterize the two sets in all positions. 

The remaining terms, "voiceless "voiced" , are often 
criticized on the grounds that the absence vs. presence 
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of glottal vibrations during the occlusiQn characterises 
the dietlnation only in certain environments (e*g. word 
medial ly between vowels) while in the word initial posi- 
tion, for example, the diatinction is signalled by the 
preeehce vs. absence of aspiration, while both sets are 
m fact voiceless, etc. However, leaving the word final 
position aside for a moment, it can be claimed that this 
kind of argumentation is basically the result of viewing 
speech as a sequence of static segments segments that 
either poesess or do not possess a certain property, e.g. 
voice. On the preeeding pages, however, I have attempted 
to show that a single mechanism, the timing of the offset 
and onset of glottal vibrations relative to the supra- 
glottal constrictory articulations is responsible for the 
seeiningly disconnected peripheral acoustie variation ac- 
companying the /ptk/-^/bdg/ distinction (this is, of course, 
what Abramson and Lisker have been maintaining for a 
long time in their writings) . Due to the nature of things* 
the voicing continuum, ranging from full voicing to 
maximal voicelessness in the form of extensive aspiration, 
involves qualitatively discontinuous changes in th^j 
acoustic output. Thus, given the subglottal air pressure 
and the supraglottal articulations as constants, the timing 
of the offset and onset of voicing (by the proper use of 
oropharyngeal cavity ej<panaion or the glottal abduction 
gesture) relative to the supraglottal articulations will 
create most of the peripheral acoustic variations quite 
automatically. Prom the point of view of the acoustic ' 
output the crucial question is whether or not the vocal 
folds are ready to start vibrating immediately after the 
release. Thus, a consideration of the voice controlling 
mechanism together with the introduction of the time 
dimension into the description brings order into an appar- 
ent chaos. 

Another argument sometimes put forth against differ- 
ences in voicing as a systematic index of the /ptk/-/bdg/ 
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dlBtinction is the fast that in whispered speach all 
segroante are voiealees and yet the distinction can be 
maintained. However, this is a special case comparable, 
for example, to the neutralisation of tonal distinctions 
in singing, and it cannot be used to falsify claims about 
the nature of normal, phonated speech. Anyway, it seems 
likely that the noise concomitants of the glottal adjust-- 
ments take on a special importance in whispered speech, 
/ptk/ being differentiated from /bdg/ on the basis of the 
stronger noises following the release in the word initial 
and medial positions, while the word final distinction is 
perhaps adequately manifested by the differences in the 
duration of the (voiceless) vowel preceding the silent 
occlusion, with the possible addition of noise differenceB 
if the stops are released in this positions. 

It was seen above that in acoustic terms, too, the 
labels "voicelesa" and "voiced" are to be regarded as the 
most appropriate ones among those mentioned since the 
number of glottal vibrations connected with /ptk/ and 
/bdg/ is the most objective and reliable single acoustic 
index of the distinction. What the terms cannot directly 
cover is the word final position, but if we accept the 
(admittedly somewhat stretched) definition presented at 
the end of section 5. to include this special case we have 
good grounds, I think, to designate /ptk/ and /bdg/ as 
/voiceless/ and /voiced/, respectively, until a more appro- 
priate pair of labels is found. I want to finish this 
discussion by quoting Henderson who, in discussing the 
related issue of 'register' languages, says that "there 
is obviously great attraction in the theory of a single 
feature that could plausibly account for such diversity 
of associated phenomena, but in my view it is too early 
to think in terms of 'phonological features'. The last 
thing we need at the moment is a 'cover feature' to mask 
our ignorance of the physical parameters involved. As 
Hend©rson himself saysi 'from a phonemic point of view, any 
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set of tarmi may serve to designate that A ^ B.* What is 
needed first Lb a thorough investigation - articulatory , 
aeoustio and perceptual - into what is actually happening 
Only when we are reasonably clear about that can we use- 
fully set up 'phonological features' of some kind." (Hen- 
derson I977i 258--2591 , 
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THE VOICELESS = VOICED OPPOSITION OF ENGLISH CnflSONANTS ^ DIFFICULTIES OF 
PRONUNCIATION AND PERCEPTION IN COMMUNICATION BE' EEN NATIVE AND FINNISH 
SPEAKERS 

Risto Hanninen 

1, INTRODUCTION 

1 J. Aspects of thi English and Finnish Consonant Systems 

The English consonants where sound pairs are distinguished by the so 
called voiceless voiced distinction, can be described as follows: 
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The affrlcati^ /tr/ — /dr/ are bracketed becausi they are not unanimously 
considered as affricates, but as sequences of tv/0 sounds. However ^ Gimson 
(1956:168), for example, regards these as complex sound entities called 
affricates, 

English has complete vciceleM ^ voiced, or ^o^tLh tmLb pairs 
in the sound categories stop, fricative and affricate. Finnish, on the 
other hand, lacks such oalrs almost completely. Only with the dental and 
alveolar stops could the other member of the pair be Included in the 
paradigm, but even here only in brackets, because it can be maintained 
that the distinction between Finnish /t/ and /d/ is primarily a distinction 
of the place of articulation^ and only secondarily of the presence or 
absence of voice. As for the bilabial and velar stops, Finnish has only 
/p/ and /k/, and lacks the voiced, or lenis, counterparts of the pairs. 
With fricatives, Finnish has only the alveolar /s/; the Finnish /v/ Is 
regarded as a semivowel, not a fricative. English can present four 
fortis — lenis pairs of fricatives at different places of articulation. 



3t .ffricjts jounds. -hi: ic.ri catsnory jo^; -.oz ...5. F.nnnh 

at al 1 . 

Ths di3;r-fcut:sn of Ln;;ii5n conionartj 1; ^,7 '"ree: HI th^ 
consonints mentioned jbove «n -cjur in v.o,'d init^,], .,nr,i medial ana wd,-d 
final Doiitioni. On cnj otne-- hand, tns i,stribu-igr of the Finnish ' 
Consnnant phonmss is jery restHctiJ: /d/, trie cnly ;.n.;,nMR ^h^t c?n hrt 
diitlnguished from its counttrnart Ihrough voice, is nstridted to v,c-d 
msrtial position; /p. and ;a,-no; cccyv in w^rd fin^l MSition; ar.a thors 
Jre itrong re5tr,ction5 in cons^nan- clusters with r.gjrd to whic'^ Mnsonants 
are allov.id to fallow eacn a-i-e-, 

"rnn ths atove it c.n bg ccnduded that the English consonant system 
iS much Mre coriplicatea tur the Finnish one, and that the vci^ifss - 
voiced, or fct:! =- ipnis .i:,.;rct-in has ,1 .;urn -r^^;^nt,,. .ijo,,,,, 
:n tnglish than in Finn is f,. 

'.i, Gfnara! Lons i -ora t 1 ons 

Whin tnt observed differenyes between the FnqMsn and F-nnish sonSDnant 
systerni; are taken into iccount, it can he ortdirtpd .vith tertointy that 
Mnnisn.spea.;'n.3 zenD)e will ^eet niany „-«at difflcuUies in ho^h Droducim 
.nd .Prcei.,r.^ ... correctly, Tne o,-,,i,n U.6y ,vas initiated 

Mcaus^ of a need to niake tnis nrediction mrs orecU^i what are tne factor'; 
conneetea w'th the fortis = lenis distinct, on that ca.se di tf ,c:il ties in 
ctimmunicition between EngMsh and Finnish people:' 

When a Finnisn-soeaMng oerson .tarti to learn thi Dronunc iation of 
Snnlish consonants, he ^ust ioim both to Produce comDlotoly new sounds 
{e.n. ,tj/ and /dv;, and to niake use of features that are distinctive 
in Fnglish but not in Finnish Sni.q. voice In /p/ - /b/, /s/ ~ /zJ). It is 
Often said 'nat learning new distinctions of old sounds is even more 
nifficult than iyarnind to Produce completely new sounds .see e.g. Gleason 
19M. 150=1^1;. ...... distinction is lucn a new distinction 

fcr the Finnish iearner. What is the fortis Ignis olstinrfion' The 
traditional te^-, of t.e ^eature. voicel ess - yo 1 c«d aistinciion. of course 
l-.ngs vocal LO^d vihr-,t,on to niind; the vocal rord- vi'^rate during the 
:ynis sounds laiso cal ieu voicM soundsj, hnt they do t vibr.t. lurinn 
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the fortis sounds (also caned voicfless sounds). It is a fact that no 
vocal cord vibration is usually observed during fortis sounds, but vocal 
cord v^bratTon is not always present during lenis sounds. This has led 
sofie phoniticians to reject the traditional terms vcUcC^h^ and vrtccrf 
and to posit the terrTis ^cita and lqhU instead (see e.g. Gimson 1966, 
32, Suom1 1976, 3-4). TheSi terms lead us to think about the myscuiar 
energy or force of articulation but, again, no empirical evidence for 
greater muscular tension during the so-called fortis consonants can be 
found. The tsrfiis Lvccctcii and vcicud, or ic^tU and Icnlb are, however, 
only kinds of abstract denominations for all the parameters that 
distinguish the fortis and Ignis consonants from each otner in different 
contexts (see Lehtonen 1972, 33). The discovery of some of thL^se 
parameters is one of the goals of this study-, the writer will attempt to 
answer the followino questions: what is most relevant to the production 
and Perception of the fortis = lenis distinction in English, and how do 
Finnish students of English succeed in producing and perceiving this 
distinction? A brief account is given below on how the problem was 
defined and what restrictions were made for the study. 

The folluwing consonant pairs were included in the study: /p/ - /b/, 
/t/ = /d/, /k/ = /g/, /f/ - /v/, /s/ - tzl, /tj/ - /dj/ and /tr/ - /6r/ . 
The consonancs were investigated in word initial, word medial and word 
final Dositions, and the main part of the study is concentrated on the 
word final position. The study concentrates mainly on single consonants, 
consonant clusters being dealt with only in connection with the word 
final position. The sounds were investlgatid only in stressed syllables 
and sentence-stressed words. Here is a brief survey of the scope of 
the study: 

1) what factors are responsible for bearing the fortis - lenis distinction 
in different contexts for a native English speaker? 

2) how can a Finnish-speaking student of English perceive this distinction? 
3} what factors are responsible for bearing the fortis - lenis distinction 
in a Finnish-speaking student's speech? Are they different from those 

of the native English speaker? 
• 4) how can a native English speaker perceive the distinction as produced 
by a Finnish student of English? 



3i 



Ss the starting pc'^l sf tna it^dy 1s 1n fact a connuni cat 1 ve 
situation where cnt q-' Che Epeakers ii Finnish and thg other English, 
and the language : i-gltsn. The final goal I5 to discover an least some 
of the ijrticylatsry and perceptual dffficulties connected with the 
foftis - lenis distinction. 

U3. TgrminolGgical Considerations 

Agreement has stiil not been reached on wnich tenris would be best 
to descri&i the so called fortis - lenis distinction. From a strictly 
phonological point of view the silection of the name for the distinctive 
feature is not very Important as long as the two sets are consistently 
kept separate by the feature. From a pedagogical point of view, however, 
it may be vn^se to choose sonit a ds tract name fur such a feature; this 
my helD to prevint th^ Unguaoe learner fron regarding a certain Dhonetic 
feature as solely responsible for carrying the feature (see Suomi 1976= 
1), In the present study, therefore, the designations 4ji.tl^ and icui 
will be used for consonants traditionally defined as -.vicUcis and vriicd, 
respectively, 

Another termirological aspect niyst also be explained. In connection 
with vowels* the tirnis cc^iid and iax will bo used rather more loosely than 
usua^. ^his netncG of traatni^nt was decided ypor, for purely practical 
reasons. In this report, the Lerm tmh& will include all English vQwels 
traditionally defined as bong's and all diphthongs; the term Ux will 
include all vowels traditionally defined as "5hort% including /m/ and /e/. 
This decision Is, of course, open to criticism, bscause it has already been 
shown that the lax vowels and /m / do not always behave m the same way 
(see e.g. Suomi 44). 

1,4. Phonetic Analysis 

Sglsction of Sseakers for the Pronunciation Test 
^wc g-Gups of ^nforrriants were sslected for the first part of the study, 
the phonetic analysis. The first group consisted of three native (British) 
speikars of English, h.. silecting the British informants, very strong 
dialectal features were avoided, and the speech of these informants can , 
oe regarded as fairly close to colloquial standard British English. This 
group will hereafter be r&fsrred to by the lettir 
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The other group conslstgd of three studints of English chosen on thi 
following criteria: male, hinnish as mother tongue and the campletion of 
the cum laude course 1n English phiMology. The Finnish group will be 
designated by the letter S (abbreviated from Students). 

1,4.2. Selection and Reading of Test Material 

In the phonetic analysis , the investigation was reitricted to certain 
temporal relations obtaining in the fortis = lenis distinction, i.e. the 
phonetic measurements v/are only concerned with durations of phonetic 
segments , 

It was decided that the test material should consist of different 
sound envi ronrnents. So it was constituted of three kinds of elements: 
single words, word groups where the word Investigated was in the middle, 
and whole sentences (sea Appendix 1). All of those formed so-called 
minimal pairs, 1,e. pairs Of words or sentences differing in one phoneme 
only. The test material was planned in this way for two reasons: firstly, 
and mainly, to receive results from as many different environments as 
possible* and secondiy, to discnver how the parameters carrying the 
fortis = lenis distinction differ in single words, word groups and 
sentences. The nymber of sentences in the test matefial Is quite large. 
This made It possible to Investigate the fortis - lenis distinction in 
an environment as close as possible to natural speech. 
The test words, word groups and sentences were written on cards so that 
there was one words word group or sentence on each card. The material 
was read from these cards by the informants, and recorded in the studio 
of the Phonetic Laboratory of the University of Jyvaskyla, using an 
AKG D-Z02 dynamic microphone and a Revox A 77 tape recorder. The recording 
speed was 19 cm per second, 

1,4.3. Analysis Procedures 

The test taoes were analysed by Fr0kj^r-Jensen ' s Trans-Pi tchmeter 
and Intensity Meter to which a 3-channel ink-writer (Mingograph) was 
attached. With this apparatys three curves were obtained , namely the 
duplex-DScilloqram, an intensity curve with high=pass filtering at 500 cps 
for recording the segmental differences of higher frequency, and anather 
intensity curve with low-pass filtering at 500 cps, which clearly indi- 
cates the presince or absence of fundamental frequency. The paper speed 
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used in the Hingograph was 10D m per second, thus giving I mm of Minyo- 
graph papir ^ l/lOn sec. Measurements were made to the nearest 0,5 mm, 
repreienting 5 msec on the tl^ne scale, A fev/ samples were also analysed 
by a Kay Elemetrlcs SOS] B sound spectrogriph . 

1.4.4, Test Parameters 

On the basis of the three curves described above, the following 
paranietfrs expressing temporai relations betv/een acoustic events were 
measured^ 

("r^td LnitUL tow^xmu. The release duration (RD) of stops and 
affricates, the friction duration (FD) of fricatives, and the extent of 
voicing (EV) of stops, affricates and fricatives. And finally, the duration 
Of the stressed vowel (VD) in the categories t^^ue and tax ^ i , U! , 

^c\d mdLaC coiiit^iianCi. Tne silent interval (SI) \ the release 
duration (RD) and the total duration (TD) of stops and affricates, the 
friction duration (FD) of fricatives, and the extent of voicing (EV) in 
all the sound categories mentioned above. The duration of the preceding 
vowel (VD). 

^c\d {uwd. ^Q^onoAxty. The sHent interval (SI) - of stops and 
affricates, the release duration (PD) of arfricates, the friction duration 
(FD) of fricatives^ the extent of voicing (EV) in the Sound categories 
mentioned, the duration of the preceding vowel (VD) in the categories 
ti^j-uc and la.\ - foJ ^ and the duration of the consonant preceding 

a word finil conSpnant (PCD). 

1.4,5. Statistical Analysis 

The results were analysed statistically in the following way-. First, 
the arithmetical means (5) and standard deviations (s) for the parameters 
in the different word positions were Cilculated for each group from the 
original data of each infonnant. This means that all between-subjict 
differences have been neglected, and each group is treated as homogeneous. 

There are differences in the Qverall rate of speech between speakers* 



1 . . . 

in connection with voiced lenis consonants the silent interval is reoiaced 
by closure duration (CD). " - 



It was noticid in this study that the avirage rate of speech of the 
Finnish infonnants speaking English was somewhat slower than that of 
the English ones. These random variations v/ould have drastic effects 
on the results of a study of temporal relations like the present ons. 
if straightfor-i^/ard statistical comparisons bcftccc>i the groups were 
made. Consequently, the statistical comparisons have been made mJJiin 
each group, i.e. how the Darameters affecting the fortis = lenis distlnctior 
have been realized within each group. Accordingly, the durational differ^ 
ences due to different linguistic categories which affect the fortis 
lenis distinction in different positions^ have been tested for statistical 
significance by the t-test of independent means within the two groups. 
The significance of the durational differences has been indicated in the 
fol iowl ng way ; 

+ - almost significant (P*^^51) 
++ - significant 
+++ ^ (P<KO 



++++ = (p<n,5-o^ 



strongly significant 



2. RESULTS OF THE PHONETIC ANALVSIS 

2.1. Presentation of Data 

In the tables below, the results of the phonetic analysis have 
been pooled from the data received from slncjle words, word groups and 
sentences, and different places of articulation. So different places 
of articulation are not treated separatelyi although in a later section 
a brief survey will be given of the realization of the fortis - lenis 
distinction in the three different environments. 

The tables below present the arithmetical means (X) of the test 
parameters for each group of informants, the standard deviation (s), 
the number of instances (N), the difference of the arithmetical means 
(X^-S^), and the statistical significance of the difference (p) as 
Spfcified by the t-test. 

2.2. Word Initial Position 

Tables 1 - 2 present the data on the elements Influenced by the 
word initial fortis- lenis distinction. 
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Tables 1 = 2. Wi^-ui outmC coiihonantt. Parafneters release duration fPD) 
of stops and affricatts» friction duration (FD) of frfcitivesthi extent 
of voicing (EV) of stopi, affricatis and fricativss, and the duration of 
the strissed vowel (VD) in the catigorlis /tense/ and /lax + m e/ " 
Abbreviationi: /affr/ . affricate. /ff1c/ ^ fricative, f ^ fortis/l - 



Table 

RD /stop/ 
EV /stop/ 



Thi English informants (Group E) 



53 
0 



data lose 



^1 



19 
98 



data lost 



44 
-98 



+++ h 
++++ 



RD /affr/ 
EV /affr/ 



107 



50 



57 



++++ 



FD /fr1c/ 
EV /fric/ 



156 
0 



S7 
77 



7a 
-77 



VO /tense 
VD /lax + 



e/ as 



ss 

35 



31 
16 



231 
96 



59 

30 



31 

16 



6 



non 
non 



Table 2. The Finnish infof^-nants (Group S) 

5^ X^^X^ p 

/Stop/ 84 data lost^ g data lost 76 

^8 106 ' -88- - .... 



RD /affr/ IQS 

EV /affr/ 0 

FD /fric/ ]57 

EV /fric/ 0 



70 ^ " = 35 



49 - " = -49 ++++ 

119 = " ^ 35 non 

90 = .go ++++ 



VO /tense/ ggj 58 28 273 30 28 ^10 

VD^/lax . e/ izn ig 12 99 18 12 21 

Due to an accident, the data on the standard deviations and soma instance 
of the oaranigters release duration, friction duration and extent 0 voicing 
were lost and cannot be presented here. 
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U^L'^.-f iiiitiaX btc]; ccui.':^ - Both thi rgl^^se duration ind tne 
extent of voicinq show stroi'iyi> significant differences between the 
categories /fortis/ and /lenis/. The release duration of fortis stops 
in the grouos E and $ is considerably longer than that of lenis stops* 
The fortis stops of the English informants are completely voicilesSj 
whereas occasional voicing is present in the fortis stops of the Finnish 
'nformants. Gimson (1966il48) advises foreign learners to pay particular 
attention to aspiration in an initial position in accented syllableSi 
the English listener may interpret lack of aspiration In such position 
as a mark of the lenis consonant. As in Suomi 's study on English 
stops* the Finnish students have "overshot" the rnodel production of the 
English speakers * and the difference in the release duration between 
fortis and lenis stops is even more drastic with group S than with group 
E. This clearly shows that Mazzaralla (1971:19) tells only half the 
truth in maintaining that failure to aspirate /ptk/ would be one of the 
greatest difficulties fc Finnish learner. In Suonii's study mentioned 
above, the less advanced Finnish group (with no further studies in English 
after Sirhool ) failed to produce the difference of release duration between 
fortis and lenis stops as fnarkedly as the English speakers. On the other 
hand* the more advanced Finnish group* (the same as group S in the present 
study) exceeded the model given by the Enr ' . sh group (see Suomi 1976M8=19). 
The present writer offers the following explanation to this phenomenon. 

At comprehensive School and at high school the teaching of the required 
grammar and vocabulary takes so much time that teaching of pronunciation 
is very limited. So an individual with no studies of English after school 
has therefore perhaps never been taught to produce the difference between 
fortis and lenis stops with the help of aspiration. He also fails to 
produce word initial stops with sufficient voice* as was made apparent in 
Suoml's study* The more advanced Finnish group, on the other hand, has 
received systematic teaching of pronunciation at the university. They have 
been made aware of the phenomena cLsplhcLtion and voccc. Now they kno^o that 
/ptk/ are distinguished from /bdg/ through aspiration and voice, and they 
apply these new concepts so strongly that they exceed the model given by 
their English teachers. Mazzarella's point that Finnish learners fail to 
produce aspirated /ptk/ is therefore correct only so far as a certain group 
of learners is concerned. 

C7 ( 
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Many Cfxtbooks maintain* and for the nost part correctly, thyt initia 
/bdg/ are often fully or partially devolced (sfg s,n, Jones l^fJiiUp, 
Gimson 1966:174). AlthQugh Initial stops can be devoictd, ^.he extent of 
voicing shows a strongly significant difference in both groups. Againi 
as in Saomi's study, the Finnish group exceeded the model -siven by th€ 
English group. According to Jones (1569^4) foreign liarners often tend 
to substitute unaspi rated fortis stops for the corresponding lenis ones. 

It is clear that this statemint only holds good for a certain group 
of learners (the less advinced, with no studies of English after schools 
see Suomi 1976:18), fhweviers group 5 showsd a clear pronunciation mistake 
at this point; occasioral voicing of fortis stops. This shows that lack of 
aspiration can sometimis efnerge even at university level. This mist^^f 
should be eliminatedt the data of the English group ^^how that fartis 
consonants in English are typicalTy voiceless thrnyghout. 

Wold uiltAal a^^^icAto coiUotiantL , — In the word initial affricate 
consonants also, both release duration and the extent of voicing shew 
strongly significant differences in both informant groups betv/een the 
categories /fortis/ and /lenis/* In group the difference in release 
duration Is less marked, but still significant. Even in affricates, the 
release duration (the duration of the fricative element) is significantly 
shorter in the lenis consonants* This is probably due to the fact that 
fortis consonants tend to be generally longer than the corresponding lenis 
ones (see, B>q. Finteft S Seines 1971 :33)', duration may be used as a cue 
for voicing. In fact, duration may be a very important cue at this point, 
because it seems that initial affricates are not so strongly voiced as 
e,g, initial stops. Especially In group the extent of voicing is even 
shorter than the release duration. It is possible that the notably small 
amount of voicing in the lenis affricates may cause misunderstanding. 
*"ni voicing difficulty mentioned by Jones may be even stronger here than 
in connection with the initial stop consonants. The fortis ^ lenis 
distinction of word initial affricates Is exemplified by oscillograms in 
Figure 1. ' 

Wijfn initAal ft^cat<'uc ccmonanth. The duration of the consonant 
itself (FD) se«3nis to have some value as a perception cue, the fortis 
fricatives a*'^ considerably longer than the lenis ones in group £. The 
Finnish group, however^ has failed to differentiate the duration between 

^ Figures 1 - 18 In Appendix 4. 
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""rtis end lenis f^ ' natives rnarkedTy fncugh for the difffrenci to be 
significant. Voir/ing seenis to be perhaps the most Important perception 
cue ^er'c^ both ^rmm show clear and strongly significant differences. 

ihs overaU 'Drrgir duration of fortis consonants biing a more or 
^e&5 importisrr ye^ption cue In all positions may call for some special 
antenCiC'^ example, lenls fricatives have an overall lower intensity 

•han l<^fi^z fricatives. This is connected with the force of articulations 

IDY force - ^fgher Intensity = longer duration, and less force - 
)m^^r intensity shorter duration, Halicot (1§70; 1591 - 1592) maintains 
th^t the force of articulation is a case of synesthesia. It has been 
prDved that force of articulation has little or nothing to do with 
3^r1culatory energy; it is the feeling that fortis consonants are produced 
-Mth greater articulatory force than lenis ones that makes the fortis 
con^nants somewhat longer. The Finnish Infonriants have felled to make 
the differe. jf duration marked enough. 

'^'^is might be explained by the fact that the duration of initial 
fricatives has no distinctive role in Finnish. The distinction In actual 
voicing, on the other hand, is clear because the Finnish students of 
English are aware that fortis and lenis fricatives are distinguished by, 
among other things, voice. The fortis ^ lenis distinction of word Initial 
fricatives is further exemplified through oscillograms in Figure 2, 

As for vowel duration after word Initial consonantSi the duration 
of the stressed vowel does not present significant differences in either 
vowel category. This is the reason for vowel duration not being specified 
for the three consonant classes, but* instead, for the results being pooled 
from the different manners of articulation. Vowels do not even appear 
to be systematlcany longer after fortis consonantSj as was the case 
in Suomi's study (cf, Suomi 1976;18). This is probably due to the fact 
that Suomi 's study was only on stops and the segmentation techniques 
used in the case of ^ stop consonants^ when the duration of the vowel is 
measured from the Instant that voicing begins after Initial aspiration, 
vowels will naturally be shortened after strongly aspirated fortis stops 
(ibid: 23), When, however, other consonant classes are included, the 
difference In vowel duration will become smaller and actually disappear. 
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2,1. Word Mtdial Position 

The results of the presgnt Study for the word medial consonants are 
prisented in Tables 3-4, 

Wo-td mgd^Cal ^tnp ccn^omnt^. In Lisker's experlinents (1957-42-49), 
the average closure duration of /p/ was 120 msec and that of /bf 75 msec, 
W^hen the closure duration was reduced from 130 to 65 msec, listeners 
reported hearing /b/ Instead of /p/» When the duration of the included 
stop was varied in 10 msec steps over a 40 - 150 msec range^ the closuri 
duration for which opinions were divided equally between /p/ and /b/ lay 
between 70 and BO msec, Lisker maintains that closure duration 1s a major 
cue for the fort is — lenls distinction in intervocalic stops. He concludes 
that closure durations shorter than 75 msec will always be heard as leniSi 
and closure durations longer than 130 msec will always be heard as fortis= 

The results obtained by the present writer do not agree wfth the 
foregoing conclusion. Instead * they correspond very closely with the 
results obtained by Suomi (1976i29); the difference in the silent 
interval is nonsignificant, although the silent interval of fortis stops 
is somewhat longer than the closure duration of lenis stops* On the basis 
of the present study at least we must therefore reject the idea that closure 
duration would be an important perception cue for the fortis - lenis 
distinction of word medial stops. As regards release duration, SI is and 
Cohen (1969:85) report that the noise burst of lenis stops has a shorter 
duration than that of fortis stops. This is proved by the results of the 
present studyi the release duration of fortis stops is longer. As for 
the total duration f the difference between fortis and lenis stops is 
strongly significant in group but only almost significant in group 
Now the total duration is essentially a combination of the closure and 
release durations; the Finnish group has longer closure durations but 
shorter release durations than the English group; when these two parameters 
are combined, the difference in the combination becomes more marked in 
group E. The most important perception cue to the fortis — lenis distinction 
of word medial stops seems to be voicing. This corresponds closely with 
previous experiments and published sources on the subject ; lenis stops tend 
to be fully voiced in intervocalic position, 

Wcfid mQ.dicut a^^^cote con&omn^. Contrary to the case with stops , 
the difference in the duration of the stop element {SI) between fortis and 
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Tables 3-4. Wcxd m^di^i cott^cmnU. Pgramelers si lent interval \SI), 
relsass dy rati on (RD) and the e^.tant of wic.wq j^EV) of stops a^d 
affricates, friction duration (FD] and th^ txtMVit of 'yofcing of fricatives, 
and the duration of chs pfsciding vowtl (VD) fh the categoriis /tense/ .^ind 
/lax + e/ and the statistical significance of the difference of mgan&. 



Tjfeje 3. Group E, 

fortes 







f, 
ft 


s 


n 


51 


/Stop/ 


73 


2.'3 


12 


RD 


/Stop/ 


44 


21 


12 


TD 


/stop/ 


114 


2B 


1? 


EV 


/stop/ 


0 


0 


12 


SI 


/affr/ 


80 


10 


5 


RD 


/affr/ 


93 


15 


6 


TO 


/affr/ 


173 


21 


6 


EV 


/affr/ 


Q 


0 


A 


FD 


/fric/ 


139 


34 




EV 


/fric/ 


0 


r 




CD 


/tense/ 


lis 


32 


:5 


VD 


/lax + ae, 


.e/ SB 


39 


15 





ienes 




di fferjBnce of means 


5 






p 


54 


12 


12 


19 


non 


11 


g 


12 


33 




SS 


14 


12 




+++ 


65 


15 


12 




++++ 


48 


3 


6 


32 




53 


12 


6 


40 


4 + 


102 


10 


6 


71 




43 


?9 


6 






76 




12 


63 


+ + ^ 


71 


16 


12 


-71 


+ + + + 


148 


17 


15 


-30 




110 


36 


15 


-30 


non 



Table 4. Group 5. 

fortes lenes difference of nieans 





% 


s 


N 


% 


s 


N 


Mi ^ 


P 


St /stop/ 


98 


41 


12 


79 


28 


12 


19 


non 


RD /stop/ 


28 


n 


12 


0 


0 


12 


23 


++++ 


TD /stop/ 


127 


41 


12 


79 


28 


12 


48 


+ 


EV /stop/ 


0 


0 


12 


79 


28 


12 


-79 


++++ 


SI /affr/ 


63 


6 


6 


83 


15 


6 


-20 


non 


RD /affr/ 


125 


13 


6 


75 


22 


6 


50 


++ 


TD /affr/ 


188 


10 


6 


158 


3D 


6 


30 


non 


EV /affr/ 


0 


0 


6 


158 


30 


6 


-158 


++++ 


FD /fric/ 


143 


13 


12 


ion 


35 


12 


43 


++ 


EV /fric/ 


0 


0 


12 


80 


52 


12 


-80 




VD /tense/ 


194 


45 


15 


177 


39 


15 


17 


non 


VD /lax + m 


US/1Z3 


65 


15 


153 


58 


15 


-30 


non 
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founds i and 



nnish lea 



TFie r 



stfpngly slgnlfiicant In group Group S has failed 
articulation^ the reason for thli is difficult to 
^ mfgftt be that affricatei are strange sounds for 
iey niight not know' how to ^'copi with" these new 

3ads to the difficulties r^v/ialed herej, 

jrfttlon, i.e. the hmorganiQ fricative elimint, of the 



fortls ai Is significantly loinger in both groups. As for the 

,^;.T-i^= fortis affricatiis are considerably longer than their 
7£ , .arts in group In the Finnish group the diffirence 

significant. This is explained by the duration of the stop 
.)s Which was longer in the lenis affricates although the 
should have been the case. 
^ fextent of voicing is again the tnost important pirception cue: 
are strongly lignT-ficant differences in both groups. However, 
"iniiish gy'oup's lenis affricates are voiced throughout, but under 
f the total durations of the English group s lenis affricates are 
lOiced. Th& fgrtis — lenis distinction is still carried by a strongly 
s'ignif leant difference in voicing , because the fortis affricates are 
-Qnipletely voiceless * but it is difficult to find an explanation to 
the powerful devoicing in the English group. 

This subject may be concluded by stating that all four parameters 
(silent Interval i release duratisnj total duration and the extent of 
voicing) are important to the fortis ^ lenis distinction of word medial 
affricates; the most important of these is the voicing of lenis affricates. 

CVatd ms^diai ^nsLt&tixtt comonarLt^, In intervocalic position the total 
durations of all conscnants can be measured, we can also compare the 
durations in different consonant classes (see Tables 5-6 below). The 
lenis fricatives of the Finnish group are slightly longer than these 
of the English group, but the difference is still significant. The 
differences of voicing are strongly significant in both groups, and 
fortis fricatives are compleCely voiceless. Although not even word medial 
fricatives are fully voiced^ this has no consequence as fas as correct 
perception is concerned. Conclusion: both consonant duration and voicing 
are important to the fortis — lenis distinction of word medial fricatives* 
vi icing being perhaps the most prominent of these. Slis and Cohen { 1969! 
BBS report a mean difference of 50 msec in length between fortis and lenii 
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fricatives in Dutch; here the difference is 63 msec for English, and 1t 
is StfOngly ilgnificint. 

Vowels preceding ward midfal fortis consonants are somewhat shorter 
than those preceding lenis consonanti in the same position. The difference 
in duration is nonsignificant, however, and thus the conclusion can easily 
be drawn that this differenci In vo.^el duration has no value as S phonetic 
cue. 

Tables 5-6. U^td medial coruonants . Comparisons of the total durations 
of intervocalic stops, fricatives and affricates in the categories /fortis/ 
and /lenis/. 

Table 5. Group 







fortes 






lenes 




di f f iren';i 


of means 




t 


s 




% 




N 




P 


TD /stop/ 


114 


25 


12 


58 


14 


12 


46 


+++ 


FD /fric/ 


139 


34 


1? 


76 


17 


12 


63 


+++ 


TO /a^fr/ 


173 


21 


6 


102 


10 


6 


71 


+++ 


Tgble 6. 


Group S. 




















fortes 






lenes 




di f ference 


of means 




X 




N 




s 


U 




P 


TO /stop/ 


ni 


41 


IZ 


79 


28 


12 


48 


+ 


FD /fric/ 


143 


13 


12 


lOO 


35 


IZ 


43 




TD /affr/ 


188 


10 


6 


158 


30 


6 


30 


non 



The ratios .of fortis and lenis consonants in the three manners of 
articulation are very close to each other in group £: fortis stops are 40,4% 
longer than lenis stops, fortis fricatives 45*31 longer than lenis fricatives, 
and fortis affricates 411 longer than lenis affricates. As for the durations 
of the different consonant classes, stops are the shurtest atid affricates the 
longest, fricatives taking an intermediate position* The corresponding 
percentages between the duration of fortis and lenis consonants in group S 
are 37,81 for stops, 30,1^ for fricatives and only 16% for affricates, where 
the difference of duration has remained nonsignificant.. Stops are also the 
shortest in group S, fricatives intermediate and at^ricatei the longest in 
both the categories /fortis/ and /lenis/. The absolute durations of all 
intervncalic consonants are invariably longer in group S than in group E, 



103 



Thi fortls linls distinction or word niedial conscnants is furthtr 
iximpllffid throygh oscillograms in Figures 3-5. 



2.4. Word Final Position^ Word Final Consonants Preceded by a Vowel 

The results of the phonetic analysis, as regards word final consonants 
Dreceded by a vowel* are suniTiarized in Tables 7-8. Here the duration 
of the preceding vowel (VD) has been calculated separately for stop and 
fricative consonants. Unfortunately, due to the nature of the ^=est 
naterial, it was not possible to gather information on vowel duration 
oefore affricate consonants. If the preceding vowel was preceded by 
/r/, such a boundary did not, in fact, exist; /r/ is a glide, and its 
boundaries cannot be determined. 

There is no information on the duration of lax vowels before 
fricatives, because no satisfactory test materia! was found. 



Tables 7-8, U't'ti ilml cotihonanth p-iecc^cd btj a iTtccl. Parameters 
silent interval (SI) and the extend of voicing (EV) of stops, the duration 
of the preceding vowel (VD) in the categories /tense/ and /lax+ee , e/ 
sefore stops; silent interval, release duration (RD), total duration (TD) 
and the extent of voicing of affricates; friction duration (FD) and the 
extent of voicing of fricatives, and the duration of the preceding 
/tense/ vowel before fricatives; the duration of the preceding vowel 
before word final consonants In general in the categories /tf:nse/ snd 
/lax + £,e ^ . 

Table 7. Group E. 

fortes lenes difference of means 
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/stop/ 




124 


33 


57 


86 


31 


57 
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/stop/ 
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37 


57 
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++++ 


VD 


/tense/ 




167 


34 


21 


268 


37 


21 


-101 


++++ 


VD 


/lax + m 


, e/ 


120 


47 


36 


169 


52 


36 


^49 


+++ 


SI 


/affr/ 




117 


3S 


12 


100 


^2 


12 


17 


non 


RD 


/affr/ 




209 


64 


12 


188 


52 


12 


21 


noh 


TD 


/affr/ 




286 


65 


12 


266 


65 


12 


20 


non 


EV 


/affr/ 




0 


0 


12 


65 


65 


12 


-65 


++++ 


FO 


/fr1c/ 




213 


94 


36 


127 


49 


36 


86 


+++ 


EV 


/ffic/ 




0 


Q 


36 


35 


47 


36 


-36 


++++ 


VD 


/tense/ 




157 


37 


30 


296 


58 


30 


-139 


++++ 


VD 


/tense/ 




157 


36 


66 


282 


60 


66 


-125 


++++ 


VD 


/la5f + m , 




108 


38 


69 


162 


55 


69 


-54 


++++ 
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Table S. Group 

fortes lines difference Of ineans % 
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N 


X 


s 






P 


SI 


/stop/ 


113 


31 


57 


96 


30 


57 


17 


+ 


EV 


/itDp/ 


0 


0 


57 


71 


42 


57 


-57 


++++ 


vo 


/tense/ 


222 


34 


21 


233 


54 


21 


-11 


non 


VD 


/lax + ae. 


,e/ 139 


40 


36 


174 


41 


36 


-35 


++ 


SI 


/affr/ 


83 


21 


12 


116 


28 


12 


-33 


+ 


RD 


/affr/ 


218 


44 


12 


165 


18 


12 


53 




TD 


/affr/ 


300 


65 


12 


271 


21 


M 


29 




EV 


/affr/ 


0 


0 


IZ 


229 


74 


12 


'229 




FD 


/frir/ 


148 


71 


36 


124 


45 


36 


24 


non 


EV 


/fric/ 


0 


0 


36 


62 


78 


36 


-62 




VD 


/tensi/ 


237 


64 


30 


252 


84 


30 


-15 


non 


VD 


/tense/ 


234 


54 


66 


247 


75 


66 


-13 


non 


VD 


/lax + ie . 


, e/123 


38 


69 


165 


64 


69 


'42 





Woi.d jimxl I ./ coiitonant^. With one exception, all the parameters of 
stop consonants present more or less significant differinces between fortis 
and lems stops in both groups. The difference In voicing also exists in 
this position; this is mainly due to the fact that fortis stops are com- 
pletely voiceless. The degree of voicing varies according to their*: pos- 
ition 1n the utterance: there is less voicing in utterance final position ^ 
i,e. preceding silencej or before a voiceless sound. Mor& voicing can be 
expected In the middle of an utterance, before a vowel or a voiced co^son^ 
ant. As might be expected , vowels are considerably longer before lenis 
stops in both vowel categories In the English group. The Finnish groupj 
however, has succeeded In making this difference only with the lax vowels, 
and even here to a lesser degree than the English speakers. The fortis — 
lenis distinction of word final stops is shown through oscillogranis in 
Figures 6-9, 

Woh^d ^inat ai^n^icata aotuonant& . Group E presents a significant 
difference only in the voicing parameter of word final affricates « Group 
On the other hand, has succeeded in making more or less significant differ^ 
ences in all parameters. However, the greatest difference in group 5 is 
also in the voicing parameter. Another thing Is the great degree of 
devoicing In the lenis affricates of group Only 241 of the total 
duration Is covered by voicingi In the Finnish group, the corresponding 
percintage is 851. 
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Thire is no Infonnitton on the mian duration of vowels before 
word final affricates, but on the basia of the oscillograms in Figure 10 
with considerable certainty the conclusion can be drawn that vowels 
are lengthened before word final lenis affricates » and the longer duration 
of the preceding vowel is an important parameter of the fortis = lenii 
distinction. 

Wond ilmJL i/UaaM\^t comonanth . — Here again, the differences 
between fortis and linis consonants are strongly significant in all the 
parameters in group Ei fortis fricitives are considerably longer than 
lenis oneSj there is a clear distinction of voicing because fortis 
fricatives are completely voicelesSi and tense vowels are longer whin 
preceding lenis fricatives. Group S has failed to make sufficiently 
sharp distinctions i except for the voicing parameter. As regards friction 
duration and preceding vowel durationj the distinctions made by the 
Finnish speakers are in the right direction i the differences are negligible 
and not sufficiently marked. 

The fortis - lenis distinction of word final fricatives is shown 
through oscinograms in Figure 11. 

CiosQA txaitiiriaZion thg mrid ilmt ionZlh ^ t^tiU dUtlnc.tJ^on, 
and tht p^onanQlaMon dJ^ii^mttie^ oi ¥4.nnl6k apeafeeA, — There are two 
parameters in the speech of the Engli^ih informants that always present 
significant differences between word final fortis and lenis consonants - 
vowel duration and the extent of voicing* The degree of voicing of word 
final lenis consonants depends on the position of the word in the 
utterance: word final lenis consonants can be more or less voiced, or 
even completely voiceless. The duration of the closure period is 
compared to the extent of voicingi in affricates the total duration and 
in fricatives the friction duration in Table 9 below. 

In group the degree of voicing varies considerably between the 
different manners of articulation** it is greatest in stops and smallest 
in affricates, the fricatives taking an intenriediate position. In 
group B the degree of voicing is greatest in affricates and smallest 
in fricatives 4 and stops are now in the Intermediate position. The 
differencei between word final lenis consonant duration and the extent 
of voicing are strongly significant in all consonant classes in group E; 
the Finnish group tend to use more voice, so that the differences are 
less marked, and at one point even nonsignificant. 
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Table 9, Word final cpnionants. Comparison of word fina^ lenii consonant 
duration and the extent of voicing in stops, affricates wnd fricatives, 

consonant duration extent of voice difverence of means 
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-CD t! 
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group E 
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36 


54 


54 


37 


54 
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96 


3D 


54 


71 


42 


54 
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X 
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N 


■TD ^EV 
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group E 


266 


65 


12 


65 


65 


12 


201 


++++ 


group S 


271 


21 


12 


229 


74 


12 


4? 


non 




X 


s 


N 


X 


s 


N 


■ FD ^EV 


P 


group E 


127 


49 


36 


35 


47 


36 


92 


++++ 


group S 


124 


45 


36 


62 


78 


36 


62 


+++ 



The duration of the preciding vowel is the most important and most 
ccnsistent parameter of the word final fortis = lenis distinction. This 
is also confinned by the general tendency that while the tnglish group 
has strongly significant differences in both vowel categories, the Finnish 
group has succeeded in making the distinction sufficiently mar'-ed for th;> 
lax vowels only. 

In some experiments on the fricativ.^s /s/ and /z/ Dunes (1955) dis- 
covered that the effect of vowel duration is not independent of the dur- 
ation of the following consonant, and vice-versa. The perception of voiciiig 
in the final consonant segment increases as the ratio of the duration of th 
final consonant to the preceding vawel decreases: the longer the vowel and 
shorter the consonant - the more the perception of voicing. This indicates 
that, although preceding vowel duration is perhaps the most important cue 
to the word final fortis = lenis distinction, the distinction is most 
strongly carried by a combination of features, and one Of these being thrft 
the unvoiced lenis consonants still remain lenis ^ e.g. are shorter in 
duration than the con esponding fortis consonants. This is noticeable as 
regards the English inforniants in the present study also', only affrica^ps 
form an exception. Raphael (1972:1296 - 1303) discovered that synthesized 
words with cues for voicing In the final consonant segment were perceived 
ai voicid following vowels which were of shorter duration than those with 
cues for voicelessness. Raphael concludes that preceding vgwel duration 
is a sufficient and necessary cue to the perception of the voicing of word 
final lenis consonants. The presence of voicing in the consonant s©gm*-jnc 
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does not have the same value , hut it makes correct perception easier* 

The quiitlDn whether the lengthening of vowels before voiced conso= 
nants Is a phenomenon specific to the English Imguage only, or whether 
VQwils are universally longer in this position was Investigated by 
Chen (1970:129 - 159). He examined the effect of the fortis - lenis 
distinction in English^ French, Russian and Korean. In all four languages 
a vowel was invariably longer before a lenis consonant than before a 
fortis one, A certain cross-linguistic validity of the statement 
that vowels generally tend to become longer before lenis consonants 
seems to exist, However » In Chen's experiments the voicing of the 
adjacent consonant influenced Its preceding vowel to different degrees 
in different languages: in English the ratio of vowel duration before 
fortis consonants to that before lenis ones was OsSl, and In Spanish the 
ratio was 0 37. Chen concludes that the variation of vowel duration as 
a function of the voicing of the following consonant 1*" presumably a 
language-universal phenomenon. The extent, however, to which the 
adjacent fortis or lenis consonant^ affects the duration of Its preceding 
vowel is determined by a language-specific phonological structure. For 
a more extensive revievi of recent work see Hanntnen 1978. 

From a linguistic point of vieWs the variation of vowel length in 
Inverse proportion to the closure time of the following consonant might 
reflect the speaker's tendency to maximize the perceptual distance 
between fortis and lenis consonants; the simultaneous reduction of 
vowel duration and the prolongation of the following consonant would 
heighten the perceptual effect of volcelessness. Conversely, the 
simultaneous lengthening of vowel and shortening of consonant would 
concur as acoustic cues for voicing. Finally* although It seems that 
the lengthening of vowels before word final lenis consonants is at 
least to some extent a language-universal phenomenon, the remarkable 
durational differences in English vowels may be seen as a perceptual 
device serving a distinctive function In the phonological system of 
the English language. 

The Finnish Informants have systematically failed to make significant 
the distinction of tense vowels preceding fortis and lenis consonants* 
The differences of lax vowels In the same context, however, show some 
statistical significance. According to many published sources * It Is 
the tense vowels that are most remarkably lengthened before word final 
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3. RESULTS OF THE PERCEPTION TESTS 

The second part of the present work coniists of perception tests. 
An account will be given below on the selection of material for the tests » 
how they were recorded and how they were performed. 

3.1. Selection of Perception Test Items 

The test itenis were chosen on the basis that two kinds of perception 
tests would be made upi one consisting of words and one of sentences (for 
the items, see Appendices 2 and 3). Another criterion was that the per- 
ception tests also should concentrate on the vmrd final cases. Finally 
14 words and 24 sentences were selected for the tests, 

3.2. Recording of the Test Tapes 

The test tapes were recorded at the Phonetics laboratory of the 
University of JyvSskylS, from the recofdings made for the phonetic analy- 
sis. Each item was recorded three times , i.e, pronounced by each of the 
throe English and the three Finnish informants. In the word tests, words 
were included both as single words and from war groups; in the latter 
case I the extra words were removed and only the test word '^/as included. 
However, the perception of single words and words removed from a word group 
was treated essentially in the same way. The items were recorded in 
random order * and the word and sentence tests for Finnish and English 
subjects were recorded on different tapes. Finally* therefore, there 
were four tapes.' two word tests containing 42 items each and two sentence 
tests containing 72 items each, 

3.3. Selection of Subjects 

Three groups of subjects (listenars) were chosen. The first group 
consisted Of ten British subjects. This group will be called group £, 
There were two Finnish groups i one consisting of ten students of different 
subjects i with a 6- or 7 - year course in English at school, but no 
language studies after school. This group will be called group 5L The 
other Finnish group consisted of ten students of English who had completed 
a cum laude course in English philology. This group will be called group 
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3.4, Test Pefformanci 

The tests were perfonned in the AA - language laboratory of the 
University of Jyvaskyla. The subjects heard the stimulus words and 
sentences through earphones. Six different tests were performed, as 
f 0 1 1 ows : 



Speakers Listeners 

test: word E SI 

E S2 

SZ £ 

test: sentence E SI 

E S2 

\ S2 E 



The tests were planned as forced choice tests , where the subjects 
had to choose between two given alternatives. For exaniplei they heard 
the word "cheap" frofn the tape. They were then supposed to answer whether 
they heard "cheap" or "jeep". The answers were marked on optic reader 
fOi^S. 



3.5. Analysis Of the Results 

The results were first transferred onto a magnetic tape by an optic 
reader at the Centre for Educational Research in JyvSskyla. The tape was 
analysed by a computer at the Computer Centre of the University of Jyvas- 
kylS. Thus percentages of correct and incorrect identifications were 
obtained j for each group of subjects in the sound categories and word 
positions included in the tests. 

In the following tables the results will be presented so that the 
word and sentence tests are dealt with separately. However, a comparison 
will be made between these two test types where possible. The percentages 
of correct identifications out of the total number of cases will be given 
for each group of subjects and for fortis and lenis consonants in different 
poiltions in a word. The following abbreviations are used: E ^ English; 
SI = the less advanced Finnish group, with no studies in English after 
school i S2 = the more advanced Finnish group, with a completed cum laude 
course in English philology* 
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3.6. Word Tests 

t^oid ZiujUat cLOiUoiuXiiU , — Two sound pairi were choien for the word 
Initial posftton, viz. the stops /t/ - /d/ and the affricates /t// - 
/dj/. In Table Z4 below the results are presented so that first the 
percentages of correct Identifications are given for all word Initial 
consonants 1n the test, and then the separate results of stops and affri- 
cates are specified. 

Table 24. Percentages of correct identifications of word Initial conson- 
ants in word tests, SI - the less advanced Finnish students, S2 - the 
mare advanced Finnish students of English, 

Speakers Finnish S2 English English 

Listeners English Finnish Si Finnish 52 

fei ted 

^o«^tis 95,8 95 100 

lenis 83,3 90 98,3 

/t/ 95,8 100 100 

/d/ 100 100 TOO 

^t// ^5,8 90 100 

/OJ/ 66,7 80 96 J 

A first general glance at the table above reveals that the number 
of correct identifications Is fairly high: over 90^, with some exceptions. 
The first surprising feature Is perhaps the English group's relatively 
low amount of correct Identifications of lenis consonants pronounced by 
Finnish students. It Is further proved that this is caused by the 
pronunciation errors made in affricates, because the /d/-s have been 
Identified 100^ right. 

Fortis consonants appear to have been Identified better than lenis 
ones, as was expected; it more often happens that lenis consonants are 
taken for fortis ones than v1ce*versa. 

There Is a clear difference between the two manners of articulation- 
stops have been identified far better than affricates. In fact, an 
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□vera]] correct Identification of ]QQ'^ was expectid for the stops, but for 
some reason this expectation did not materialise* What is even more 
Surprising is that the errors were made by group E in identifying 
The fewer correct identifications in affricates is probably caused by 
the fact that affricates are completely strange sounds for FinnSi they 
are unaccustomed to distinguish them both in pronunciation and perciption. 

If we compare the three listener groups * group S2 has done best. 
Only in two cases out of six is the percentage of correct identifications 
less than 1001, and even in these cases the amount of errors is very small. 
Group Sf comes second* while most mi sunders tanding is found in group E. 
This is, of course* to be expected = From the results of the phonetis 
analysis we know that the Finnish speakers do not always produce the 
fortis ^ lenis distinction in the same way as the English ones. This, 
in turn* means that the English listenirs do not always distinguiih what 
the Finnish speakirs mean to say. 

Wcf^d mudisii conhonant^ , Only one sound pair was included in the 
word tests as an example of the perception of word medial consonants * 
namely the affricates /t// - /dj/. The word pair where thi sounds appeared 
was a subminimal b/te£tAe4 - bA^idgt4* This pair was included because the 
phonetic analysis revealed some interesting facts concerning the duration 
of the stressed vowel in the words. The results of the perception test 
are presented in Table below. 

Table 25. Percentages of correct identifications of word medial affricates 
In word tests. 

Speakers Finnish S2 English English 

Listeneri English Finnish SI Finnish S2 



Of special interest here is the considerable number of errors made in 
identifying the word fa^eea/ie^ correctly. The less advanced Finnish group 
{Si) has in fact made more incorrect than correct identifications. This 
can be explained by the very slight difference in the duration of the 
stressed vowel In fa^eee/itii and b%idge^. 



nf/ 



75 

95,8 



40 
100 



70 
100 




The voicing of the leni^ affricate is also very weak, and the dur- 
ational difference fairly slight (only 40 maec)* so it is not surprising 
that the word is misinterpreted. The reason for group i misinterpreting 
25 of the /t//:s Is nore complicated; it is known that Finnish speakers 
tend to make the vowel too long in this position. One possible e^^pla- 
nation is that the English listeners have misinterpreted the long vowel 
duration as a cue for the lenls affricite /dy, because the vowel is in 
fact lengthened even h.?re, in a medial position, when it is followed by 
a lenis consonant. 

The two 100 identiflcat^-ons of groups SI and M in the affricate 
/dj/ have a single eKplanation: orthography. When a Finnish speaker 
hears a vowel that he would classify as "short" (about 70 - 90 msec 
or so) in his mother tongue and simultaneously sees two words with one 
vowel grapheme in one and two in the other, he will have no hesitation 
in choosing the one with one vowel grapheme if he is asked which word 
he heard. 

The results of this mini-perception test clearly show that nothing 
is self-evident in the correct perception of English words and sentences. 
We would expect that perception would be easiest in word medial position, 
where, for example, the voicing feature, which has hitherto been perhaps 
the most prominent in the fortis - lenis distinction, is most powerful", 
but as was discovered, one must be very careful with such expectations 
and generalizations. 

Wand iime coiUi^naHti, Most items of the word tests were concen- 
trated on word final cases. The results of the test have been presented 
in Table 26 as follows: firstly, the percentages ot correct Identifi- 
cations of all word final fortis and lenis consonants have been calculated: 
then the percentages of stops, of which only one word pair, the subminimal 
tnuf - caierf was included, are presented. Fricatives have again been 
dealt with so that, firstly, the percentages for all fortis and lenis 
fricatives have been calculated; then they have been divided into single 
fricatives and fricatives preceded by /n/. The single fricatives have 
first been treated as one group, and then divided Into /$/ - /z/ and 
/f/ - /V/. As for the fricatives preceded by /n/, there is only one 
word pair, iuicc = h(n^. 
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Table 25, Percentages of correct identifications of v/ord final consonants 
in word tests. 



Speakeri 


Finnish S2 


Engl 1sh 


Engl i si 


L. 1 3 tci i S f S 


Eng 1 i sh 


Finnish SI 


Finni si 
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f or t i s 
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92, S 


1 eni s 


60)4 


51 ,2 
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/t/ 


100 


83,3 


100 


/d/ 


50 


86 j 


100 










fort is 


59,5 


60 


90 


leni s 


63*8 


46,7 


68,9 


MiiQ £c 
















fort is 


41J 


60 


85 


lenis 


95,8 


31 ,7 


50 


/s/ 


4,2 


26,7 


70 


/z/ 


91 ,2 


13,3 


36,7 


/f/ 


75 


93,3 


100 


hi 


100 


50 


63,3 










/s/ 


9S,8 


60 


100 


/I/ 


0 


56,7 


96.7 



In general fortis consDnants seem to have been identifiid better than 
Ifnis ones^ The more advanced Finnish group has again done best, the 
reasons being clear: they have been listening to mistake-free uttirances, 
and they have also been taught, at least to some extent, to recognize the 
differences from a native English speaker's speech. The less advanced 
group has made far more errorsi this points to . .nclusion that the 
teaching of pronunciation at university level . some effect, at 

least as regards ability of perception. On the ^ fhcr^ hand, it seems to 
have had less effect as regards ability of pronunciations the number of 
correct Identifications of the English listener group Is not very high, 
this, of course, being due to pronunciation mistakes made by the Finnish 
speakers. 
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The stop consonants in tKis position were iKamined only in the 
subminimal pair cojt — ^.ucfi. The result from the English group is 
somewhat contrary to e^cpectation. As has been noticed above, the Finnish 
spaakers tend to mike the vowel in eoif too long, and the /s/ too short. 
It was therefore expected that this longer vowel duration connicted with 
thf shorter duration of the /s/ would have been mistaken for the word 
liX^^di but the contrary took placet aa^ad was mistaken for csiit in 50?. 
of the cases. The more advanced Finnish group has cleared this hurdle 
without errors and the other Finnish group has also done fairly well. 

What was said above does not apply to fricatives, however. The 
figures of group E in single fricatives are as expectedi over half of 
the fortis fricatives have been identified incorrectly, this being due 
to the over-long vowels pronounced by the Finnish speakers in the words 
icc^Vf The vowel duration Is approximately the same in Cc^^Ct Caai^s 

■and nere the results are much better. It is interesting that the /s/is 
have been identified almost completely incorrectiy, whereas three quarters 
of the /f/is are correct, A possible explanation for this is that there 
is usually more voice in the final segment of i'cave. than in that of Cobc, 
especially with Finnish speakers, /z/ seems to be the consonant most 
subject to devoicing in the final position, and it very easily becomes 
complitely voiceless. The less advanced Finnish listeners have failed 
to interpret the differences of preceding vowel duration in the question 
of /s/ - 111 . With /f/ - hi the results are better, although half of 
the /v/:s are again wrong. Group %'l have also failed rather badly to 
perceive that a longer vowel duration and shorter duration of the frica- 
tive must be interpreted as the lenis /z/ - only a little over one third 
of the identifications are correct. The /s/;5 have been identified better, 
and the /f/:s 100^ fight in fact, but surprisingly many errors have been 
made in /v/. This may be due to the fact that the English speakers did 
not use as much voice in the word final /v/;s as the Finnish ones. 

The last section of this test, word final /s/ and Iz} preceded by 
/n/i offers a number of interesting points,. In identifying the word 6uice, 
the mistakes made by the Fnglish listeners are very few* This is of 
course explained by the fact that the Finnish speakers pronounced the /n/ 
sufficiently short. The word on the other hand, has without 

ej^ceptlon been identified as iuice* but this also has a simple explanation: 
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th& /n/ in ^hifi^ was not sufficiently lengthenid by the Finnish speikeri. 
The final /%/ was also made too short, and the /z/, in turn, too long. 
The only possible consequenci of these pronunciation mtstakes was that 
all the words were interpreted as iincc, with the fortis /$/ in the end. 
Group S2 hiS iucceeded very well in interpreting the different ration of 
sonofant and obitruent durations (long sonorant ^ short obstruent ^ /z/ , 
and vice-versa ^ /s/). The other Finnish group is in intermediate pos- 
ition, this going to show that they have not really b§en aware of how 
these two wordi should be distlngy ished. 

To sunmarize the results of the word tests* the word final position 
is the one where incorrect perception most frequently arises. In this 
position there are many points where Finnish speakers cannot even 
satisfactorny make the fortis ^ lenis distinction perceivable to an 
English-speaking listener. For Finnish listeners the problem is made 
easiers because they can listen to model pronunciations^ Especially In 
connection with the word final lenis consonants, however, Finnish listeners 
often fail to make the right distinction. This must be due to the fact 
that the function of sound duration Is here completely different from 
what it is usually in Finnish, and Finns cannot interpret these vari- 
ations of sound durations correctly. The less advanced Finnish group 
especially is. In a way, "out on a limb" here, because they have received 
hardly any knowledge of these kinds of variations in vowel and consonant 
durations. 

3.7. Sentence Tests 

The perception test consisting of sentences was almost completely 
concentrated on consonants in the word final position: only two sentence 
pairs were chosen for the word initial and medial positions. 

bJo^d InLtLaZ coiUonafitM. In the sentence pair chosen to represent 
the word initial positioni the sounds under investigation were the some- 
what disputed affricates /tr/ - /dr/. The percentages Of correct identi- 
fications are presented below in Table 27. 
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Table 27. PercentagiS of correct identifications of the word initlil 
affricates /tr/ and'/dr/ In sentince tests. 

Speakers Finnish S2 English English 

Listeners English Finnish SI Finnish S2 

/tr/ 79,2 70 73,3 

Mr/ 95,8 65,7 73,3 



The na1n feature carrying t^^^ ^iistinction between /tr/ and /dr/ is 
the devolclng of the /r/ after /t/; the Irl becornes a kind of voicelsis 
fricative element, and this originally gavs rise to these sounds being 
called affricates. Another distirctive feature is that /tr/ is usually 
longer than /dr/. However, it sometimes appears that the /r/ - element 
IS partially dcvcicQd after the lenis stop /d/ as weli. If the difference 
of duration is not especiany marked, the percentages of correct identi- 
fications can even be as low as they are in the tnhle above. A compar i son 
between the initial affricates /t// and /d^/ in ward tests, and the 
present sounds, shov;s that the results in the sentence test are a little 
poorer, especially in oroup Sf. However, the results are not directly 
comparable, because they are not exactly the same sounds in buth tests. 
But this test again shows that even where we suppose it self-evident 
that the words will be identified correctly, our suppositions do not 
always hold good. The word final position is the most difficult one, 
but not the only one to cause problems. 

The word medial consonants need not bo dealt with here, because it 
was discovered tha^ thi> v/ord pair decca^^: - d<scn^p ls primarily 
distinguished by the duration of the stressed vowel, and thus it will 
be covered in connection with the test results on word final consonants. 

ft'c^cV ihmt can^onmth. The perception test with most material was 
the sentence test on word final consonants. Two different sound groups 
will be covered below separatelyi first the single consonants, and then 
the consonant clusters restricted to sequences sonorant + obstruent. The 
results on single consonants are presented in Table 28, 
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Table 28, Percentages of correct identifications of mrd final single 
consonanti in sentence tests. 



Speakers 
Listeners 



Finnish S2 
Engl ish 



Engl ish 
Finnish Si 



Engl ish 
Finnish S2 



fort is 
1enis 



77,3 
68*6 



74,8 

68, a 



84,2 
77,9 



fortis 
lenis 

/P/ 
/b/ 

/t/ 
/g/ 



86.5 
78,1 

70.8 
62,5 

91.7 
85.4 

87,5 
79,2 



81 .7 
91 ,6 

80 
80 

81 J 
95 

83,3 
95,7 



86,7 
95,8 

70 
90 

95 

98,3 

86,7 
96,7 



fortis 
lenis 

/s/ 
Itl 

Ml 



65.3 
66,7 

52,1 
60,4 

79,2 



84,4 
42,2 

78,3 
43,3 

40 



88,9 
63,3 

85 

66,7 
56,7 



87,5 

100 



73,3 
86,7 



96,7 
73,3 



The niQre advanced Finnish group has again obtained the best results, 
while groups SJ and E are fairly equal, 

Fortis stops have been identified better than lenis ones by the 
English group, whereas the opposite is true of the Finnish groups. In 
the single pairs of stop consonants the same relation also obtains, with 
the exception of the /p/ - /b/ of group SI, where the percentages are 
equal. With fricatives, the relation of correct identifications is again 
opposite to that with stops: now the English subjects have identified lenis 
fricatives better, 5? cannot identify even half of the lenis fricatives 
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correctly, and even corroct Identifications only reach two thirdi 

of the total at best. The resuUs of the fortii fricatives are much 
better, C:<p1 anations for the Identification Grrnrs are easy to find, 
and they are readily provided by the q<ici Uonr^-s in Fiq, The words 

and pronounced by the Finnish inforTTiant are in fact completely 

equaU the only factor to distinguish these two words would be context of 
utterance. The same also applies to the stressed vowels and final 
consonants of the words t/ccaau' and dlh^abc. As it seems that the most 
frequent mistake of vowel duration made by Finnish speakers is that of 
making tense vowels preceding fortis fricatives too long, the natural 
consequence of this is that fortis fricatives are more often identified 
as the corresponding lenis ones by English listeners. The difference in 
the tense vowel duration before lenis fricatives was not as marked between 
the groups* although the Finnish Informants' vowels were here somewhat 
too short. This explains the reason for a greater prnpDrtiQn of the lenis 
fricatives being Identlf'ed correctly, as /z/ or /v/. However, the 
correct identifications do not conie to 100 , because the vowels are not 
lengthened sufficiently. Preceding fortis fricatives* the difference of 
vowel duration between the infor-mant groups is greater, and more perception 
errors are consequently made. Wc have already noticed that, in connection 
with word final fricatives especially, the iniportdnce of the preceding 
vov/el duration as a r.c to the fortis - lenis distinction is very important; 
the lenis fricatives ^re often very strongly devoiced 1n word final pos- 
ition, esDecially iii . 

What, then, explains" the fact that hoth Finnish listener groups have 
identified the fortis fricatives far better tlian the lenis ones? They have 
succeeded in recognizinq the fairly short tense vowel duration as a cue for 
the fortis fricative (though not in all cases), but they have failed to 
Day attention to the considerably lengthened vowels precDding 1 eni s"'f rica- 
tives, especially the less advanced group. Vowel length is known to be 
a very important distinctive feature in Finnish, but Finnish speakers are 
certainly not used to paying attention to vowel duration in this context 
They do not expect tense vowels (traditionally "long") to be subject to 
considerable variation of length in different contexts, but expect them 
to be always about twice as long as the lax (traditionally "short") vowels, 
as in Finnish. Thus the "perception filter" of their mother tongue prevents 



them from realizing thi most important feature carrying the distinction. 
It Is only natural that this pirciption filter is even stronger in group 
5J, which has not really received any systematic teaching of pronunciation. 
This is also reflected in the results: Sf has identified lenis fricatives 
better than the less advanced group. 

There is little to say about the affricates. The only case of 100% 
identification is found herei the Finnish informants have succeeded in 
making the distinction /t// - /dj/ well enough to be completely under- 
standable. The majority of the Finnish informants have realized the dis- 
tinction* 

In connection with the fricativeSt it was pointed out above that 
all the listener groups had special difficulties in identifying fricatives 
following tense vowels. Here, all the stressed vowels were tense ones. 
An opportunity is provided to compare the percenta^r^':. of correct identi- 
fications when the preceding vowel is a diphthong in one category, and 
a lax vowel in the other. The final consonant in this case is a stop. 
The results are presented in the table below. 



Table 29, Percentages of correct identifications of word final stops 

preceded by a diphthong or a lax voweU 

Speakers Finnish S2 Englisn Engl^iih^ 

Listeners English Finnish SI Finnish S2 

VLpkthong 

fortis 81,3 83,3 85 

lenis 68s8 85 95 

Lax \jow^l 

fortis 91J 80 88,3 

lenis 87,5 98,3 98,3 



Contrary to expectationi the abcve figures till us next to nothing. 
About the only thing to be said is that stops preceded by a lax vowel 
are in general identified better than those preceded by a diphthong. But 
there is no reflection here of the difficulties in both production and 
perception that were found in the fricative section of Table 28i and in 
the phonetic analysis. It seems, after alU that one must still be very 
cartful about making diphthongs directly comparable to tense vowels. 
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a'jT.t ^t»;.iC ^^>i^r,:At\Ci i-c^^>i<d Lij M'th^r: rc^ilrn^nt. Two different 
types musz be diatingu iihed here, on the basis of the nature of the 
preceding consonint. It can be in obstrugnt /s/ (or /i/) followed by 
the stops /%/ or /d/, or a sonorant /!/ (or/n/), followed /t/ or /d/, 
and .^5/ or n/ , respectively. In Table 30, where the results of this 
test are shown* the consonants have been classified into two categories 
on the basis of the final consonant, i ,e. into stops md fricatives. 
It is also Dossible, hov/ever, tn make comparisons on how perception is 
affected by the nature of the preceding consonant. It also turned 
Out tnat the most convenient wgy was to include the test words them= 
selves in the table. 



idhlc 30. Percentages of correct identifications of word final consonants 
preceded by another consonant. 



:'peaF.ers 
L TStener- 



fortis 
lenis 



Finnish S2 
;; nnl i^h 



66,; 

50 

33,3 

66,7 
33,3 



English 
Finnish S1 



^8,3 

?3.3 
66 J 

^3,3 
73,3 



English 
Finnish $2 



60 

66,7 

70 

73,3 
50 

96,7 



F'Lf cat i h 



37,5 
58,3 



73,8 
^3,3 



96,7 
76 J 



The number of correct identifications i^ here rather low- only 
a CQuDle of Dercentages over m'' can be found, and the numbers of correct 
identifications vary from one third to two thirds approximately. In a 
comDarison of the listener grouDS, S2 is again in first place, its lowest 
ffqure being 501 correct. 

The correct identifications of stops vary from 4a, 3" to 66,7'' in 
fortis stops, and from £0^ to 66,7: in lenis ones ^ almost identical 
figures. Ho-^vGr, the v=/ords poi^ - rt^icu on the one hand, and iultt 
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bLLLld on the other* are dlstinguUhed on completsly different groundi. 
In the pair pa^t - poicdi both the duration of the diphthong and the 
intirvening Qbstrugnt are affected by the fortis = lenis distinctions the 
diphthong especiany is lengthened in ps^ed- The intervening fricative 
elenient Is longer in po6t^ the differgnce was proved nonsignificant by 
the t-test (see Tables 10-11)* although the oscillograms (cf. Fig, 15) 
clearly show that It is fairly marked. In this distinction, there are tea- 
tures that are among the most difficult for a= Finnish speaker: tense 
vowel or diphthong preceding a fortis consonant , which is generally made 
too long, and a fortis fricative, which is generally made too short. 

In the pair buMU bwUd, however, It Is the Intervening sonorant 
/]/ that is affected by the fort Is- lenis distinction. It is considerably 
longer In btUldf whereas the /I/ is almost equal in duration to the /I/ 
in biJLitt, This is a third great difficulty for a Finnish speikeri he 
tends to make sonorant consonants before fortis obstruents too long, 

The above difficulties 1n pronunciation are also reflected In 
perception. Groups £ and S? especially have had noticeable difficulties. 
The English listeners have identified the fortis stops better, v'hich Is 
a little surprising, knowing that Finnish speakers especially tend to 
pronounce vowels and sonorant consonants before fortis obstruents In the 
wrong way (I.e, too short). Both the Finnish groups have identified the 
lenis stops bitter, though In St the difference between fortis and lenis 
is very small in po6t - po^ed, but larger In buLtt - biLttd, 

Finally* the fricatives preceded by /n/- First of all, the results 
of the test appear to be somewhat better than those of the stops. This 
can be at least partly explained by the fact that the test word pair 
i&nae. — ^eni appeared In utterance final position, so that advantage 
could be taken of perception cues Involved both with the preceding sounds 
and the consonants themselves. The stops appeared medially in the utter- 
ancei, in a position where the stops themselves were amalgamated with 
the following sound, the fricative /J/, so that they had to be identified 
almost solely on the basis of the preceding sounds. 

The distinction of the pair ^ence — iM Is in a way similar to buUM 
biLLld: the intervening sonorant, in this case /n/, is considerably longer 
in i^^iti i,e, preceding /z/. To this is added the varlat ' n of the dur- 
ation of the final fricative: the /s/ in iznat is consie.^ably longer 



136 



ERIC 



Figure 9. Word final stops. 

VD = vowel duration (miic) 

51 = silent interval (msecj 

= extent of voicing (msec) 

CD = consonant duration (msgc) 
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Figure 10, Word flnil affrlcatei. 

VD s vowel durition (msec) 

SI = silent Interval (msec) 

RD ^ release duration (msec) 

TD = total duration (msec) 

CD = conionant duration (msec) 

EV ^ extent of voicing (msec) 
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Figure 11. Word final frieatives. 

VD ^ vowel duration (msec) 
FD = friction duration (miec) 




(Engl ish speaker) 



(Finnish spiaker) 
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Figure 12* Word final fricatives. 

VD - vowel duration (msec) 

PCD - duration of the consonant preciding 

a word final consonant (msec) 
FD - friction duration (msec) 
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Figure 13. Word final fricotivis, 

VD = VQwel duration (msec) 

PCO ^ durition of the consonant preciding 

a word final consonant (msec) 
FD ^ friction duration (msic) 





(English speaker) 



(Finnish Speaker) 
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Figure 14. Word final stops, 

VD ^ vowil d--^tion (msec) 

PCD - duration of the consonant preceding 
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I word final consonant (msec) 
n:ient interval (msec} 
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Figure 15, Word final stops, 

VD - vowil duration (msec) 
PCD s duration of the consonant preceding 
a word ffnal consonant (msec) 
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Figure 16, Word final stopi. 

VD = vowel duration (msic) 
PCD ^ duration of the consonant prscedtng 
a word final consonant (msic) 
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Figure 17. Word flntl fricitlvis. 

VD = vowel duration (msec) 
FD - friction duration (msec) 
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Figure 18, Word final fricativii, 

VD - vowel duration (msec) 
FD - friction duration (msec) 
EV = extent of voicing (fflsec) 
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THE DISCRIHINAIION AND IDLNTlFltJATlON OF ENGLISH VOWELS, CONSONANTS, 
JUNCTURES AND SENTENCE STRESS BY FINNISH COMPREHENSIVE SCHOOL PUPILS 



REIJO LAHMINMAKI 



INTRODUCTION 

Ihis Study is an attempt to map the factors which influence the 
perception of English vowels, consonants* junctures and sentence stress 
by Finnish comprehf nsi ve school pupils. It is a shortened version of a 
report (a pro gradu thesis) which is part of the Finnish-Lngl ish Con- 
trastive Project at the English Department of the University of Jyvas- 
kyla\ The original report contains a more profound discussion of the 
background and a more detailed presentation of the results- Many problems 
similar to those dealt with In this study have been examined before. For 
example, many perceptual problems caused by English vowels and consonants 
for Finns learning Lngilsh have been traced back to the phonetic identi- 
fication cues. The evidence has, however, mainly come from word level 
studies, ihe present study attempts to map the relevance of these and 
other cues at the sentence level. 

IHE AIM OF THE STUDY 

This study is concorned with the difficulties encountered by Finnish 
comprehensive school pupils in learning English receptive pronunciation 
Skills. Because of the many-sided nature of the receptive skills, the 
study has been focused on the potential sources of difficulty caused by 
certain phenomena belonging to English phonology such as they appear in 
the network of understanding sentences. The material of the study consists 
of minimal (phonological) oppositions (eg, /I/ - /i/, /k/ - /g/) at both 
segmental and suprisegmental level. The oppositions occur in sentences 
like bid u loo kigli/HU bad Ih too kigk, ie, the sentences form 
pai rs , 

The reception difficyltles of this kind of material are studied by 
means of two different perceptual tests: the discrimination test and 
the identification test-. By means of the test results and by using cer- 
tain background variables (like the mastering of the vocabulary of the 
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identification test items and the school marks 1n different subjects) an 
attimpt Is made to answer the following questions: 

(1) riow are the mani feitations of the phonological distinctions used in 
this Study discriminated and Identified? 

(2) What factor Influence the particular discrimination and identifi= 
cation resuUSi le. the receptive process dealt with in this study? 
What are the causes of the difficulties in the reception of the 
phonological oppositions tested? 

(3) What do the discrimination and Identification tests measure? Is the 
mastering of the productive skills predictable by the discrimination 
and/or Identification test results? 

THE METHOD OF THE STUDY 

T/ie m€LtQ^%ial di& 6iudij, - From the point of view of studying the 
perceptual process, natural speech (1e. conversation) would be ideal 
material. The use of this kind of material is not, howeverj possible for 
various reasons (see Lehtonen 1977: 39) and therefore minimal pair tech- 
nique was adopted. Natural speech was approximated by using complete sen- 
tences instead of Individual words. In order to be comprehensive enough 
the material consists of both segmental (1e. vowels and consonants) and 
suprasegmental (junctures and sentence stress) aspects of the English 
phonological system. The study is based on the use of minimal phonological 
oppositions that are mostly contrast! vely (Finnish - English) selected, 
ie. oppositions found difficult for Finnish learners of English in previou 
studies are included in the material I his Is especially true of the seg- 
mental items whereas the choice of the suprasegmental material cannot be 
made by such means because empirical contrastive knowledge of the supra- 
segmental phenomena Is scanty. 

The occurrence of the minimal oppositions In sentences causes a cer- 
tain amount of selectlveness and also places some restrictions on the 
choice of the material. Firstly* the Insertion of one opposition member 
into an identical sentence frame with its counterpart eliminates the use 
of certain oppositions completely; it Is difficult to find any sentence 
frame for many oppositions. For example, the insertion of the word pairs 
leaJi - -Ceaua and LUkLto^ - wiuj dwo^a into an Identical sentence 

frame Is Impossible because the grammatical functions of the members of 
both pairs are different* Secondlyj the material should not be too dif- 
ficult for Finnish comprehensive school pupils to cope with. Thirdly, the 
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occurrence of the phonological items studied varies both in regard to 
their position in a woi^d and 1n an utterance. On the whole the choice of 
the mater^ial 1s a compromise between suitable sentence frames and con = 
trastive criteria of difficulty. 

The seiection of the vric^Ch was mainly based on Wlik's (19S5) con= 
trastive phonetic study of Finnish and English vowels. Wiik (1965: 59) 
analysed the forinant structures of ;i 1 1 the Finnish and Lnglish vowels 
by sound wave analysis. By comparing the phonetic distrlDution areas of 
Finnish and English vowels and by looKing at the results of the substi- 
tution tests administered by Wiik (196&: 64, 67, 80} certain 
English vowels could be specified as probable sources of difficulties of 
of perception (identification and discrimination) for Finnish learners 
of English. Perceptual difficulties can often be expected when the pho- 
netic distribuEion area of an English vowel correlates with two differ- 
ent Finnish vowels. 

The present study concentrated on English tense-las oppositions 
because tne perception of these oppositions seems to be especially 
troublysDine for Finnish iGarners. However, the opposition /a/ - /a/ has 
bam included and /ij/ - /y:/ has been disregarded. The vowel oppositions 
which were selected are presented in iable 1, 

Table I. Vowel oppositions and the number of times they ero tested In 
this study 



Opposition tested 




/U/ - /I / 

/!/-/£/ 4 

/3/ - /^'/ 3 

/ A / ^ /.I / 4 

/A/ = /a / 3 

TuUl 18 



The Finnish and English cunhOHant btj^tmh differ from each other in 
at least two respects ^ (1) The number of tnglish consonant phonemes Ms 
nearly double the number of Finnish ones. (2) The English system has a 
distinction not existing in Finnish, ie, fortes-lanes, the number of 
fricatives is also clearly greater in English {11} than in Finnish (2) 
(Lehtonen, Sajavaara and May 1977; 128). The greatest number of learning 
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problems Seem to be centred round the LngHsh fricatives and fDrceS" 
lenes oppositions (Moisio and Vilento 1976: 49-53). Accordingly?, the con- 
sonants selected for this study are fricatives and foftes^liniS opposi- 
i:ions. Table 2 presents the distinctiDns studied. 

table 2. Consonant oppositions and the number of times they are tested 
In this Study 



Opposition tested 


TimsL tested 


/P / - /b / 


3 


/t / - /d / 


4 


/k / - /g / 


4 


/tj/ - /dV 


3 


/tr/ ' /<ir/ 


2 


/s / - /e / 


2 


// / - /t(/ 


2 


/■I/ - /dz/ 


1 


Total 


21 



kinttiiMi is not c3 discrete concept, as vowels and consonants are, 
because it is connected with a larger part of an utterance, A common 
definition f-^r juncture is that it is a "non-phonemi c modification of 
sounds at certain gr.™atical boundaries" (Lehist© i960: 9, Hoard 1966: 
106) i ie, the gram^tiatical boundaries of language structures can modify 
sounds; if modifiable sounds happen to occur at such boundaries, junctures 
are present and observable, but if no such sounds are present, junctures 
are not phonetically perceptible. Thus junctures are bundles of identifi= 
cation cues in phonetic surface representation and the cues are condi- 
tioned by higher level language structures. 

In some cases, the segnientatiQn of the same sequence of phonemes 
into units in two different ways changes the meaning of the sequence, 
which means that juncture is responsible - at least on an abstract level 
of phonology - for tha phonological distinction in question. This dis- 
tinction need not, however, be phonetically present.. Ihe juncture material 
consists of 13 pairs of sentences. 

The function of the Mnti^ntQ ^fAf.H is to make certain parts of an 
utterance prominent and, i"n this way, to influence the semantic content 
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AcLurding tO Luhton^^ru Siijavaara and May ()977: 139), the difficulties 
incounterfed by Finns in learning the English affricates are similar to 
these of the plosives^ both members are difficult. From the point of view 
of a Finnish learner^ the situation derives frofr, a physical difference 
between the TL and the NL (Wiik 1965: 17) because no affricates occur in 
Finnish. The most insportant phonetic cues given by Gimson (1956: 1 73) are 
"the transition between the preceding vowel and the- stop'' together with 
"the explosive onset of friction". According to Strevens (1960: 41). thet^e 
is '"a brief intervenirig ( s] or ^ z ) -like friction before[J^J and t^j] ele- 
ments". In other words, the cues are based on various kind^ of noise 
bursts and friLLions. Ihis situation serves as a potential source of dif- 
ficulty because "a Firin may have a tendency to underdiscriminate the 
various kinds of noise bursts which appear ifter the release of the initial 
alveolar n^Qsive in English" (Lehtonen, Sajavaara and May 1977: 139), 

cVv-' ^itirii :tf: - - The means* standard deviations and mean 

percentages for the identification of /tj/ and presented in 

fable 11. The mean of the lenis member /dj/ is clearly higher than that 
of thfe fufti-^ {Memher /tJ/ (20.7 and 5,3, resp.). The difference between 
the muans is 'Statistically very significant, fhus the contrastive expec- 
tation is not Supported. This does not, however, mean that in a purely 
linguistic sense the lenis member would be as difficult as the statistics 
show. In all the three tests the lenis member was favoured by strong 
prdgmatic cues whereas the fortis member did not have this kind of ad- 
vantage. The discrimination percentages were quite low. 56 % (eKcept in 

c:;<K/q/i?, R8 :). This may be connected with the ^Mnherent" difficulty for 
Finnish speakers of perceiving the distinction between /tJ/ and /dy , For 
example, Moisio and Valento (1976: 74-75) found this opDOsition to be one 
of the most difficfilt among the bnglish consonant oppositions for Finnish 
learners to identify (the i den ti flea ■ n test, however , differed from the 
one used in this study). In two cas^^ the discrimination test percentage 
was 56 and in one case SB . (mean 66,7 y) . This speaks in favour of the 
"inherent" difficulty of this opposition. The high percentage is explained 
by the '^elear" Idiolect of the sample in question. 

Table 11, The identification of the allophones of the opposition /t|/ - 

/d3/> _^ _____ 

Consonant Times Mean Stdev " T-value Significance 
tested tested . _ „ 

/tJ/ 3 5,3 2,08Z 21,3 

/dj/ 3 20.7 2.517 82.7 "'5?2 * * ^ 



OCiiet criiAtMuinf oi'pubitiun^. - OppoiitLvn itx! = /dx/ . -The potential 
difficulties of /tr/ = /dr/ are In the fortls - lenis character of this 
opposition (ii. the distinction between /d/ and /t/ and In the release of 
the initial stop). In addition, the linking of /t/ and /d/ with /r/ may 
cause perceptual problims due to the different shades of friction, Ihe 
problems are similar to those of /t)/ - /dy. 

Far-reaching conclusions about the difficulty of the /tr/ - /dr/ 
opposition are not possible. It seems that this opposition Is problematic 
for Finns. The discrimination percentages are low (16 7. and 48 and the 
Identification is mainly based on pragmatic cues. An interesting detail 
is also the Finnish learners' problems in perceiving the difference 
between /tj/ and /tr/. On the whole the results support Moisio and Valen- 
to's (1976; 74-75) findings about the difficulty of the /tr/ - /dr/ op- 
position, although the lowest discrimination percentage (15 %) st ms to 
be pahtly caused by the "mashing" effect of linking. 

Oppu^ULon !hl - /Qi\ - The opposition /s/ - /S/ Is different from 
the oppositions dealt with above in that the distinction between its 
members 1s not that of fortls - lenis. Instead, the minimal difference 
Is maintained by the place of articulation, /s/ being alveolar and /B/ 
dental (see Lehtonen, Sajavaara and May 1977: 143-144). Both members 
belong to the fortls series = A comparison of the Finnish and English 
fricatives shows that in English five places of articulation (labiodental, 
dental, alveolar, palato-al veol ar and glottal) are distinguished whereas 
the number of the places distinctive In Finnish Is only two' (dental and 
glottal). Therefore speakers of English must make subtler distinctions 
between various kind of frictlonal noises than speakers of Finnish. Leh- 
tonen, Sajavaara and May (1977: 145) point out that "a Finn has to learn 
to hear the difference In quality between the 'sharp s' and' the 'hushing 
s' as well as between the tense (voiceless) and lax (voiced) fricative", 
According to Gimson (1966i 180), one of the cues for fricatives Is the 
extent and position of the noise coniponent. Ihe continuous noise ranges 
from about 3600 to 8000 cps for /s/ and from about 1400 to 8000 cpS for 
/8/ (see also Strevens 1950: 4U Hughes and Halle 1956: 306-307), The 
intensity of /s/ Is also greater than that of /y/ (Gimson 1966: 180). 
Perhaps the most Important cue is, however, in the nature of the adjacent 

^ The position of /f/ is marginal in the Finnish phonological systim 
because it occurs only in some loan words, often pronounced as /v/. 
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vocalic glide {Gimson \^bb: im) . As a whole both /s/ and /g/ nre liable 
to cause discrimination and iden ti f ica tion problems. The opposition /s/ - 
/9/ seems to be difficult both to identify (niean fib ) and to d1scr1mi = 
Mate [man 66 ). iipwever, most of the problt^iis are caused by the prag- 
fiiatic cues* perhaps because Mnguistic cum'^ are = for one reason or 
another - very weak, 

OijprbiCL.:H = fp. = The distinction between the palato-alveolar 
/jV and /tj/ is that /]"/ is a fricative and /t\/ an affricate. The status 
and cues of the affricates were discussed above. According to Glmson 
(1956: leO), the acoustic cues of /j/ are the extent and position of the 
noise component, a continuous noise in the spectrum ranging from about 
2000 to 7000 CPS, Sirtiilarly to /s/, /// has a relatively high intensity, 
"innish has no corresponding sound, except in certain loan words, but 
even there its realization is not usua 1 ly ijl but rather [s] , for example 
in the word takki {'chess'), fhe area of the variation of the Finnish /$/ 
IS quite extensive. This is implied by Mazzarella (1971: ^1) when he 
states that "Finnish speakers pronouncing English make the hiss of English 
/s/ too sharp so that an Englishman may hear U as The difference 

OtjLween Finnish and Lngli&h is, in Wiik's (1965: i /) terms, physical both 
in /tJ/ and in /j/ . The fact that in Swedish the sounds CJ] and [^11 
may De In freti variation may be confusing because the informants have 
studied Swedish for ihrQo years. In brief, both /tf/ and /// are liable 
to cause identification and discrimination problems. In Moisio and Va- 
lento's ( 1976: 7'U7b; study, this pair was found to be fairly easy. It 
rny be easy if certain canditions prevail but difficult under different 
conditions, for example when connected with some other sounds or when 
some pragmatic cues are present, A reflection of this is given by the fact 
that the discrimination scores ranged from 92 , to 40 and the identifi- 
cation ones from 9Z to 40 

OppubLtiini /tp ^ - ihe members of the opposition /tJ/ - /dz/ 

differ from each other in that /if/ is a voiceless p.ffricaLe and /dz/ a 
voiced consonant cluster. The phonemic status and the identification cues 
of /tp were discussed above. The distribution of /di/ is mainly limited 
to the word-final position (Gimson 1966: 171), This is one reason for the 
cluster interpretation of /dz/. Since the documentation of the acoustic 
cues of /dz/ is scanty, both /d/ and /z/ will be dealt with separately. 
For the cues of /t/, see above, /z/ can be defined as a voiced alveolar 
fricative. The continuous noise in its spectrur^ ranges from 3500 to BOOO 
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CPS (Gimson 1966: ]tiQ), ie. the noiie has a relatively high frequency. 
The duration of /z/ is Tong and its intensity low. Thus the difference 
between /tj/ and /dz/ may be both in voice and in friction. However, the 
ffloiE important cue in the word-final position is the la'ij;n of the pre = 
ceding segments, while a reduction takes place in front of the fortis 
member of the opposition. The single /t// - /dz/ opposition included in 
this study was discriminated by only 36 . of the informants probably 
because none of the informants were able to use the length cue mentioned 
above. The identification took place mainly by means of pragmatic cues 
as the quite high percentages, 64 ^ and 80 -o, show. 

Co Hi onant f ppo 6 Ltio iUt ga tidAtii notkiM. - F i g u re 2 i 1 1 u s t r a te s the 
identification and discrimination Of the consonant oppositions included 
In this study. 
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Figure 2. The discrimination and identification of the consonant oppositions; 

the mgan percentages of correct identification and discrimination 
(y^axis) for the members of each opposition (x-axis). Each op^ 
position IS presented as a group of three columns; the one with 
oblique lines stands for the mean discrimination percentage and 



the blank ones gi 
opposi tion member 



e the mean identification percentagiS for both 



The mean percentages of both discrimination and identification in each 
Opposition correspond to each other quite well. The oppositions that con- 
tain only one or two items tested show greater internal variation. There- 
fore it seems that when the number of items is incriased, the solution 
(mean) percentages become more uniform, ie. the effect of various fluc- 
tuations is neutralized. In some pairs both members are difficult for 
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Finnish learners, which tends to make solution percentages more unifonru 

The most difficult opposition included in this study was /tr/ - 
/dr/ , which supports the Idea of its being i special case of the ill = 
16/ opposition. A striking phenorrienon was the low identification per= 
ctnta^jtj of / 1 j / , which can largely be expliined by prigrriatlL criteria* 
The same trend is also seen in /tj/ - /da/, where the discrimination 
percentage is very low and the identification took place by means of 
other than linguistic cues. 

In general it is difficult to say which oppositions cause the 
greatest difficulties for Finnish learners because the context (both 
linguistic and pragmatic) greatly affects the degree Of difficulty in 
each case. For example * the falling of the sentence stress may compMcate 
or facilitate the processes of discrimination and/or identification. The 
Fnglish consonants are not easy or difficult in themselves: difficulty 
or ease is a function of the specific conditions in which the consonants 
occur, 

Juiic turn' o\:\^ohitiv\i^. - Jiuic tu\vb in ¥l\\)iLhh ind Lit fugCo/j. - The 
review of the concept of juncture revealed that the existence of various 
boundaries may be determined by gramfnati ca I criteria* For the comparison 
of Finnish and English Junctures the function of the linguistic unit 
called word becomes essentiah According to Karlsson (1977: 59), the word 
carries more Information in synthetic languages (like in Finnish) than in 
analytic languages (like in English)^ Accordingly, the word in Finnish is 
a ffiore injportant unit both semantical iy, grammatically and perceptually 
than the v/o rd in English, This is reflected, for example, in the d 1 f f i ^ 
culty of defining the concept word in tnglish. The Importance of the word 
means that there is a greater need In Finnish for word-internal cohesion 
and also a greater need for boundary markers in Finnish than there is in 
English (for more details, see Karlsson 1977: 59-63), 

Certain phonotactlc rules Signal word boundaries in both languages. 
In Finnish, no consonant clusters are permitted in the initial position 
of a word (except In loan words). The clusters /sr/ and /Ir/ do not occur 
in word-medial positions whereas /ps/ and /pi/ are found in these posi- 
tions only (Wllk 1973: 47). Simple velars and labials are not permitted 
in word-final positions (Karlsson 1977: 65), Consonant clusters are 
negative boundary markers indicating that the word continues (see Tru- 
betzkoy 1958: 277). 



In Finnish, vowel harmony requires that tht front vowels /y, o, aV 
cannot coexist in the same word with the back vowels /u, o, a/ {see eg. 
yiik 1973: 47; Irubetzkoy 1958: 284-285; Lehtonen, Sajavdira and May 
1977; 152). For exampii* the String Of sounds /ybjuoksu/ IS pirceived as 
two words. Because the suffixial vowels of a Finnish word ars deterrnined 
by the vowels of the root, vowel hafinony also contributes to word-interna 1 
cohesion (Karlsson 1977: 65). In addition^ the morphophonemic alternations 
create redyndancy faecause, for example* the alternations in the suffixes 
can be prgdicted by the alternations of the root. The redundancy on its 
part facilitates decoding (see Karlsson 1977: 68), 

No phenomenon equivalent to Finnish vowel harmony exists in English, 
and certain othir phonotactic rules signal word boundaries* /j/ does not 
occur in word-initial positions and therefore no bound^iry is possible 
before 1^1 . /r* js w/ can occur in clusters as non-initial elements. In 
word- final positions only /1/ may occur before non-syllabic /m^ n/ {for 
these and other examples* see eg, Gimson 1966: ^39^256), 

The phonotactic rules set up the possibilities of the occurrence of 
word boundaries, but, in the actual manifestation of the word level 
junctures* various phonetic cues are present. In Finnish words, the pri- 
mary stress is on the first syllable and it acts as a positive boundary 
marker indicating where the word begins (irubetzkoy 19BB; 277). In English, 
the main stress of a word is not necessarily on the first syllable (Gimson 
1966: 222). Therefore the English word stress is not such a uniform bounds 

ary signal as the Finnish' one- 

2, -, 

According to Lehiste" (1964; 1/4), in a Finnish vowei + vgwei sequence 
the occurrence or non-occurrence of a word boundary between the two vowels 
is signalled by the quality of the second vowel; a reduced vowel signals 
the absence of a juncture and a non=reduced the presence of one. In English, 
the quality of vowels may also serve as boundary signals in some cases* for 
examplo the pair I %c.\^op\ - rce^citHt^?! is differentiated* among other things, 
by the long and reduced /ai/ element (Gimson 1966: 375). 

^ In the phonetic transcription Finnish letters have been used since they 
refer to the corresponding phonemes in one-to-one relationship. 

^ Lehistfi compared some Finnish compounds whose first part ends in a vowel 
and whose second part begins with a vowel with similar non^compound words, 
eg, /lintuansa/ (object form of 'his bird') ~ /llntu-ansa/ ('bird trap'). 
The differences between the manifestations of the V + V sequence were in- 
terprettd as signals of the presence or ibsence of a word boundary. 
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Ihy dui^dtiun yf iht* ^lerond component in a Finnish V+. sequence is 
longer if the word boundary fails between the vowels than if there is no 
boundary iLehiste 1964: l/S). A similar tendency is seen in Lnglish, where 
Lehiste (1960: 42) found initial allophones to be considerably longer than 
tne fnedidl or post-Junctura 1 ones. For e;^ample, the inititil plosive / 1/ 
was about five times as long as the medijl /tj. Moreover, nazal .ation 
can De a Juncture cue in Finnish (Lehiste 1964: 77). 

Su, Daniloff and Hanunarberg (197b: Zb4) tried to find out whether 
nasal coarticulation can mark the presence of a Juncture in English. 
Hfjwever, low level Junctures were marked with little pause lack of 
coarticulation] and only clause boundaries showec consistent pauses. 

The use of glottal ization or a glottal stop as a juncture cue has 
been reported 1n both Finnish (eg, Lehiste I96at 175) and in English 
(eg. Lehiste lygO: 45). According to Lehtonen and Koponen (1^77: 76-79), 
the distribution and fynction of the glottal stop is, however, different 
in these languages. In tnglish, g lottal ization may occur in word-initial 
and word-mediai positions in front of a syllable beginning with a vowel. 
In Finnish, it is possible only at a word boundary. Lehtonen and Koponen 

/J) say ^hat "tne essential differtince in the occurrence of 
glottalization in English and in Finnish is, thus, the fjct that English 
glottalization is a phonetic means connected with syllable boundaries 
and emphasis, and not an ynambiguous marker at word boundaries. In Finnish, 
glottalization can never be used v/ord-medial ly to mark a syllable boundary 
beginning wi th a vowel . " 

Because glottalization _in occur only between words in Finnish, 
Finnish speakers may also regard it as a word boundary cue in English, 
this is supported by the study by Lehtonen and Koponen (1977: 75, 84). In 
front of words beginning with a vowel^, Finns used glottalization more 
frequently than it was used in standard English pronynciation. Glottaliza- 
tion was also often used instead of linking (Lehtonen and Koponen iy77; 
7fai. 

The aspiration of the English fortis plosives in the initial position 
and the lack of aspiration in other positions may serve as a strong cue 
for English speakers (Christie 1974: 819; Lehiste 1960: 42-^43). For 
example, In the pair fcr^p ^ticting - topi UckinQ the /t/ of the first 

' In Finnish, glottalization is used above all as a boundary signal in 
connection with words beginning with a vowel. 
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member is unaspi rated and that ot the second member is aspirated. No 
corrispondlng phenomenon can he found consistently in Finnish. 

The comparison of Juncture cues in English and Finnish shows that 
their correspondence is not very great. It seems that Finnish speakers 
use English cues Only acci-jentariy* for example, when a glottal stop 
happens to occur at a word boundary or when the primary stress happens 
to fall on the first syllablg of a word. The profalem is also compiicated 
by the fact that the rialization of phonetic cues is often speaker-depen- 
dent (see, eg,, Kent at 1974; 470; Moll and Daniloff 1968: 680) to 
such an extent that they may becOfne neutralized in connected speech (Gimson 
1966: 301), This gives reason to suppose that pragmatic cues can be quite 
strong in the perception of juncture oppositions. 

IdQ^ntLii^aClon juncMiAe. opposLtions , - The identification and 
discrimination percentages (y-aKis) of the juncture oppositions (x-axis) 
are presented in Figure S below in the form of a diagram. The figure shows 
that the identification of the juncture oppositions is inconsistent both 
between and within the oppositions (solving percentages range from 4 % to 
100 *), The same applies to the discrimination percentages (from 20 % to 
96 ^j). In general, the linguistic cues that Finnish informants use in 
perceiving juncture oppositions are the occurrence of the word stress on 
the first syllable of an English word or a period of glottalization pre- 
ceding a word beginning with a vowel- The highest percentage is elicited 
by a glottal stop in front of a vowel in the initial position of a word, 
ie. in the case of the pair That p^xlnct S/pUncei5 4s oiLt ^'tlmd. Some 
other similar pairs [tha ms^tt - them cat; gtey tie - g4eat ei/e; a tawcx 
liotat* - a-t oLiA /louiej a time. - m a</ji; ^te^ Pamuj - ^4ecd Aiuilt) which 
did not have any glottal stop or had a very weak glottarization in this 
position were clearly more poorly discriminated than pitncc S - pninccA^. 
So it seems that the glottal stop for Finns must be "clearly" perceptible. 

Many other cues found to be responsible for the phonetic manifesta- 
tion of Junctures in sofne previous studies (eg, Lehiste I960: 19* SO, 3S; 
Hoard 1966: 98-103; Gimson 1966: 300) » such as the quality and length of 
segmental sounds, provided they have been present in the samples of this 
study, have not been prominent enough for Finnish Infortnants to use. This 
becomei understandable if we know that nia ny of the durational and quality 
cues were found to be inconsistent and even native speakers found it 
difficult to perceive distinctions like these (see eg. Hoard 1966: 101). 
According to Gimson (1966: 300), "the function of these cues dipends on 
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Figure 5- The di scrimi niti on and identification of the juncturi oppositions-, the percentagis of correct identi- 
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the speaker's consciDusness of the word as an independent entity." In 
other words, various linguistic cues ire products of the particular 1dio= 
lict in question, Ihe realization of the "theoretical" cues obviously 
differs both interpersonariy and within every speaker's pronunc ia tion. 
Some good examples of the pairs containing weak cues 1n this study are 
tuUph - £iv Cip^, mij to - T and nitrate - tUgiit-xata. In these 

cases the linguistic cues ha'., been srnall and difficult to detect and the 
identification has oeen based on pragmatic cues such as sentence meaning. 
In fact, the number of pragmatic cues 1s quite great for many pairs. This 
might mean that the abstracting of word boundaries Is really a great 
problem for Finns learning English. 

S.jntetw-' ittuii oppa^ittijiL^. ^ Sentence stress has two functions in 
English (see eg. Lehtonen, Sajavaara and May 197/: 6/=69- Lehiste 1969: 
150^151: Bolinger 1961; 83; Malmberg 1968: ill). Firstly, every sentence 
has a normal non-emphatic and unmarked sentence stress, which is pridic- 
table by means of the syntactic structure of the sentence, Seconoty, if 
the purpose is to give more ©riphasls to a word or a morpheme, contra$,tive 
of enDhattc stress is applied. For instance, in the exaniple given by 
Lehtonen, Sajavaara and f!ay ( 1977- 68) Vchtcndau J boa^kt a lawn'^ biCifcCa, 
the semantical ly and logically most important words are stressed, ie. the 
words u^^toAdaif and bicifcCi!.. But If this unmarked stress pattern ls changed, 
for example, by giving the word buugiit a special emphasis, the underlying 
structure and thus the meaning of the sentence changes (about the surface 
and deep structure interpretation of contrastlve stress see eg. Bresnan 
1971: 269 and 1972; 326^327; Chomsky and Halle 1968: 15= Berman and Szamosi 
1972: 319). 

In Finnish sentence stress can express roughly similar phenomena as 
In English. According to Wiik (1977: 45=46} , it can signal the constituent 
structure of an utterance, the thems-rheme relationship or contrast (see 
also Nurmela 1934: 180; Lehtonen, bajayaara and May 1977: 6Bj . 

The phonetic cues responsible in English for the realizatton of stress 
are quite unanimously and amply documented; the pitch of the voice, the 
quality of vowels, the duration of segments and the intensity of voice 
(see eg. Lehtonen, Sajavaara and May 1977: 69; Lehiste 1969: 131" Morton 
and Jassem 1365: 178; Fry 1965: 151; Lieberman i960: 451; Brown and 
McCione 1974- 971; Adams and Munro 1978: 153), According to Lehtonen, 
Sajavaara and May (1977: 68), the mos'; effective cue both in Finnish 'and 
English is the change in the pitch of voice, although the pitch variations 
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in Finnish are niuch smaller than in Lngliih. ThuSs Finnish learners of 
English are not accustonied to such extensive pitch variations as eAiSt 
in tnglish. Tne placement of stress ls, hov/yver^ largely deternijned by 
tNG neLining to be conveyed (eg, Ginson i966: ZiS) and, accordingly, by 
Ine context and lituation where the particular sentence occurs. There- 
fore pragmatic cues may have a central role in the discrimination and 
identification of sentence stress oppositions. 

IfUi ii( bcnuyii nation aiij idi'ntiiicaticii l-^ tiCntdicc btii'hb, ^ The 
discrimination and identification percentages (y==axis} of each sentence 
stress opposition x=axis) are presented in Figure 6, It shows that the 
problems related to sentence stress are of two sorts. Firstlyj all the 
iriformants have not been able to perceive sentence stress distinctions. 
With two exceptions, the discrimination percentages fall between b4 , 
and 76 . Ihe fact that the solving percentages of the oppositions are 
quite evenly distributed makes the difficulty consistent* ihe two de= 
V i a t i n r] discrimination percentages ( i e , ^3 for ' Uh i Cv tUjui c fick i tc 
'fU_-'jh>' and 96 for 'vihitiu^ ^ <\jmu's .\' < i ( t uih '-^Ki-'Ui/^) seem to be re = 
h'iCeU to tny iduilyct of the test samples. 

The second probleni is related to the function of the sentence stress ^ 
i e = t he learner must u n d 0 r s Land the y n ,::e r 1 y i ng structure maintain ed by 
the sentence stress. One probable explanation for that is that the 
learners have not alv/ays known on v^hich part of the utterance the sen^ 
tonce stress should fall to give a particular shade of meaning. Often 
thli has to do with the understanding of graniina t i ca I structures. The 
situation is further complicated by a quantity of pragmatic cues directing 
the identification process. 

The effect of all these factors is reflected in the high deviations 
among the identification scores (ranging from 8 to SB .>) . (wo identic 
ficatioh scores even exceed the corresponding discrimination result, Ihe 
results show that Finnish comprehensive school pupils may have difficultie 
of the same degree with sentence stress as with segmental sounds or junc= 
tures even though in principle the functions and the phonetic cues of 
sentence stress both in Finnish and English are to a great extent similar. 

D(Ji<ficiLttiiJ^b tviatiHi tu tlw tdQntiJiLcation and diAc/LimlHation o?^ 
tiw uppobittan^h. - The difficulties related to the identification and 
discrimination of the oppositions tested are caused by a number of factors 
in each particular case. The factors are quite heterogeneous* because 
th^y vary greatly from context to context. 
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Figure 4, The discrimination and identification of the sentence stress 
oppositions-^ the pircentages of correct Identification and 
discrimination for the mimbers of each sentence stress opposition. 
Every oppQSition Is presented by means of three colunins; the one 
with oblique lines D) stands for the discrimination percentage 
and the blank ones stand for the identification percentages* 




The difficulty uf tho oppositions does not vary according to lin- 
guistic categoriBS {ie. voweU. consonants, junctures and sentence 
stress), [able M beloy/ shov/§ the mein scores^ mean percentages ana 
standard deviations for the discrimination and identification of the 
it&ms belonging to these categories separately and for the identifica- 
tion and discrimination skills in generah 

Table 12. The means, (x), mean percentages (x ), and standard devietions 
(d) of the discrimination and identification of the vowels^ con 
sonantg, junctures and sentence stress 
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The figures indictjte that both in the case of discrimination and of 
idsntif icatlon the differences in difficulty between the categories are 
very sinali indeed; in the d i scr imi ria 1 1 on the greatest variation, which 
is between the sentence stress (mean 6S.b j and the consonants (meari 
:iJ.4 IS ofiiy 3 , in the identification the greatest variation, about 
hi , Is found between the consonants (mean b6..U ) and the vowels/sen- 
tence ^-.^.ress (mean for both 4/. 3 ). hccordingly, the division of the 
oppo^. :ions into suprasegmental s {- juncture + sentence stress) and seg- 
men* ;;s vov/els + consonants) does not imply any clear difference in 
difficulty. A lot of inconsistency is also found 1n the identification 
and discriniination of the various si'hciasses of the vowels and consonants 
(eg. /A/ = /a/ and /p/ ^ /b/). 

Because the difficulties are not distributed according to the lin- 
guistic categories mentioned above* it is to be concludsd that the oppo- 
sitions occurring in sentence pairs are not necessarily easy or diffi- 
cult as such; instead, the difficulty is determined by the positions 
they occupy in an utterance or expression. Therefore the Independence of 
the sentince pair oppositions is more limited than that Of the word pair 
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oppositions, for exdinple* This Is reflected for instance 1n the fact that 
contrastive predictions fall short in many cases in predicting the per- 
ceptual problems of the sentence pairs; the power of contrastive predictions 
varies because they are supposed to forecast the possible difficulties 
between phonological ly distinctive units in carefully controlled circum- 
stances, 1e, extracted from the context in which they are found In utter- 
ances . 

The importance of an ppposition can be evaluated only if we know all 
the linguistic and non-linguistic cues related to its perception. In each 
case a bundle of cues is responsible for a particular identification or 
discrimination result. The bundle of cues differs griatly from one situation 
to another. This is clearly seen when the discrimination and identification 
of the sentence pairs are compared. Though discrimination, in qeneraU 
seems to be an easier process than identification (mean solving percentages 
G4,4 /, and 5M respectively), the variation within identification and 
discrimination and also between them is great. One and the same opposition 
may obtain clearly different identification and discrimination percentages 
in various positions. The identification score may 1n some cases be even 
higher than the discrimination score (eg. Turf - dead). A common phenomenon 
is that one member of an opposition is so much better identified than the 
other that its percentage exceeds the discrimination score of the respective 
pair. This seems to be especially the case when the discrimination score^ 
for one reason or another , has remained low. This means that when the in- 
formants cannot perceive the linguistic cues properly they rely on other 
cues. This may sometimes faciiitare the identification of one member but 
at the same time make the identification of its opposite member difficult. 
This would mean that the role of the oppositions pivt for the understand = 
ing of utterances would decrease ana the importance of the other cues would 
Increase. The fact that the informants regarded the identification test 
as easier than the discrimination test may indicate that the informants 
have often used sentence cues and disregarded those belonging to phonology 
because they might have been difficult to detect. 

The reasons for the utterance dependence of the difficulties caused 
by the various oppositions are to be found in the redundancy present in 
speech. The native speaker needs to hear only a part of an utterance to 
understand the message. He can make predictions about the coming speech 
and fill in the possible gaps in the code (see eg. Tomnola 1977: 25; 
Lehtonen, Sajavaara and May 1977^ 18-19). The situation of the native 
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speakir is different from tnat of a foreign langyagi learner, becausf 
thi FL lo . nannot always use redundancy to a slmnar iJ<tent. He m%t 
perceive a nu;;;ber of Cues 1n order to understand the messagf ; If he can- 
not hear as many cues as are necessary for understinding , ho must fill 
In the gaps with some sort of meaningful cues. He may sometimes be able 
to hear the purely linguistic cues but sometimes he must rely on pragma- 
tics. Therefore the fundamental difficulty in the perception of the pho- 
nological distinctions occurring in sentences is in the interpretation 
cf all the cues that are available in the situation. It is impossible to 
predict exhaustively what cues will be present^ sometimes they are lin- 
guistic, sometimes not, sometimes the linguistic cues are important, in 
Other cases not, etc, 

Ati ijxjuniiiation t/ic pm§matic ctici . - The important part that the 
pragmatic cues play as independent variables explaining the discrimina- 
tion and identification test results makes it necessary to examine these 
cues in more detail. Pragmatics = though its definition may vary (see 
eg. Gn idijk and Stokhof I97B: 51) - regarded as a part of coimiunica- 
tive corripetence, the other part of which is grammatical competence, ie, 
the Interrelationship between form and meaning (see eg. Wales and Campben 
197/: Z49; Hyrnes 1971: 282; Corder 1973: 92; Sajavaara 1977- 21; Saja- 
vaara and Lehtonen 19/7: 14; Kohonen and Saleva 1978: 13). According to 
Sajavaara (197/; 14), prv; atics Is a set of nJes for the mutual inter- 
action of people in speech situations. Sajavaara (1977: 21-22) gives a 
list of the possible parametres which may play a role in a communicati ve 
act: "speaker, hearer, time and place, code (broken down to various com- 
ponents), channel, various prerequisites of the speaker-hearer (knowledge 
of the world, knowledge of the other parties of the speech event, social 
relationships and roles including various rules of politeness and hier- 
archies, norms, understanding of earlier messages, most of which is 
normally covered by the concipt of presupposition), intentions of the 
speaker and hearer, affective states, non-verbal elements of the com- 
municative act and problem solving capacity". Ihe scope of the present 
study does not, however, include any interactional speech events and 
therefore the subjects act only as hearers and cannot use potential 
situational cues proper. The pragmatics of the study mainly covers dif- 
ferent factors which may intervene in the identification and discrimina- 
tion of the phonological oppositions, other than the linguistic factors 
proper. The bulk of the pragmatic cues are related to the impressions 
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given by the mianing, syntax, ^ords and social factors of Ins tsst 5en= 
tencii in the discriniinatlon and identification ''situations*'. The ten^n 
'linguistic cue' has been risirved for the cues tha* are deduclfale f^m 
the phonological iystefn and are in mony cases explained by iTngulstic 
contrastive meani. 

In the following list, the pragmatic cues are piven In the order in 
which thty will be dealt with belowf 

K The mastering of the words containing the structuj'e? tested, 
2. The probability of meeting the words containing the structures 



3» The probability of nieeting the test sentences. 

4» The leigtn of the cest sentences. 

5, The difficulties of analysing the test serjtences, 

6, The infonnatlonal value of the test sentences. 

7, The referential appropr lateness of the test sen^^.enc^^j. 

8, The concrete vs. abstract reference of the test htntences, 

9, The general vs* situational truth reference of the test sentences* 
IQ, The attitudes and presuppositions towards the messages o:^ the te3L 

sentences . 

11. The social messages of the test sentences. 

12. The person-centred images in the test sentences. 

13. Orthographical confusions. 

14. Idiolectal variation, 

15* The confusion of English and Swedish. 

The mastering of the words containing the structures tested is the most 
coiwion pragmatic cue that affects the identification and perhaps also the 
discrimination. Poor mastering of the words makes it difficult for the 
informants to analyse the strings of sounds and to recognize the structures 
that contain the most essential linguistic cues. This is especially true 
of junctures because the recognition of words (and the key words in par- 
ticular) serves as a necessary prerequisite for the detection of the 
juncture eyes proper. Moreover, in the case of segmental phonemes it is 
easier to pick up identification cues 1f the respective word is well- 
mastered, The identification process may sometimes be based on the per- 
ception of i word more than on that of an individual phoneme. 

The elimination of the word effect Is impossible^ some worfis are 
always better mastered than others. The informants of this study had 
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studlid English for six ytirs ind the word effect is ciear. In at ieast 
25 Cfi^ues poor mastering of tht kty words has lowered the Identif icatfon 
perL'v-'i'agi of the rgspictivg Dppositio»i rriember. Ainong thi poorly niastergd 
woriv dre^ for example, /lewij tftiCj hut, baiHi tauM, ut^toi* o^'^xfi ;hI:c', 
iv.t'H and L^'L^l-cttii.' ? but It is not enough to catalogui these words because 
tni;-: difficulty is relative* the difficulty depends on the point of re- 
r-:;*"!'"!^^. The poor mastering of a v^ord in one member of an opposition may 
'•vceme especially emphasized if the corresponding word in its opposite 
Tpembei' 1s very wall mastered, thus the word effect is to a large extent 
or^ftii tion-speci f ic. 

The high profaabllity of coming across certain words increases their 
fafm 1 lari ty . fwo words niay be nearly equally well mastered but one or the 
other is more popular becaure of its familiarity. For example, in the 
pairs i^ivi- -iiwvk:^ ^lUp- slice : . baa-bact and ^^licndti-F tench the first mem- 
bers are better Identified partiy because their occurrence from the point 
of view of the infomants is more probable than that of the second mefn- 
ber, tf'.ougn both members are about equally well mastered. 

The main reason for the famiharity of the words is their frequent 
occurrence in school language, ie. In textbooks and classroom activity. 
In addition, these words are usual iy among the first English words that 
the pupils meet at school. Some examples of these words are: hitX, tu'c, 

Li*i, cat, bikCf bagt CL^it, hlwk , t hi iit and ^xiuiui. Again, the degree 
of famiMarity depends on whtcn ofhor v^ords they are compared with. When 
A word is very familiar, its phonological and orthographical structure is 
well mastered and the picking up of the linguistic cues felated to it 
becomes easier. But the word may also be too familiar, so that the in- 
formants tend to hear it even in cases where It is not present* 

The probability of the occurrence of the tesi: sentences can direct 
the discriminatioTi and especial ly the identification process^ certain 
sentences occur more often in contexts in which thfc infonnants are in- 
volved* A suitable context speeds up the recognition of the key words 
(see eg. Tulving and Gold 1966: 326). For example, the first members of 
the sentence pairs TlUh u ijl^ux ^C4i!i Aivi;^^ isn* t itt hUj pciU/f?inh an: nut 

and It Lb a iKWK'/cin ciim are more probable than the second members. All 
the sentences are formally acceptabje but the situations where they can 
be met differ considerably. Those having to do with school language are 
naturally favoured. But some expressions are more probable in the in- 




formant's everyday lives, too, Thiy are frequintly met in Finnish, 
which increases their fenii 1 iari ty , 

The iLngth of the test sentences also affects discrimination and 
identification. For example, Hughes and May (1974: 3?) state that memory 
effect is more clearly present in the discrimination between ni1nim5l 
sentence pairs than between mi nitnal word pairs. AccorL*ing to Fry (1970* 
4rO* short term memory capacity limits the Infonnation which people 
can handle in a given time. This may be true for d1 scriminatlon * but 
not so much for Identification. In the Identification process the 
rneaning hai a central role, and people can remembir mfcanlngful sentences 
as against unstructured strings (see eg. Slobin 1971; 21"^ Jauhiainen 
1974: 13), As a whole the identlff cation process seems to be based more 
on the meaning than on the discrimination process, Howeveri if a sentence 
is ver=y long (eg. Tha zoo' ^ bxzkftahh £to a fjcpufat talking p0<nt. U'/icji 
Z tCfU i4.-t£<ri0 in the, boat I htCLxtad to ^iiik/ think, or It <i the biggeht 
tcA-pL't/£Caf.\'tC in the it-oxld] the picking up of the linguistic cues 
beconies difficult. To see what the influence sentence length was, the 
test material was divided into Short and long sentisnces (five words or 
less vs, more than five words). The mean scores and percentages for 
both discrimination and identification are given in Table 14. 

Table 14. The identification and discriminarion of short and long 
sentences . 

Sentence Di scrimi nation Identification Total 
length - ~ — ^ — ^ — — - 



short 
long 



f 


X 


xl 


f 


X Xb 


X 


x% 


61 


17.3 


69,4 


122 


14.0 55.9 


15.7 


62,6 


59 


14.4 


57.8 


118 


12,0 48.1 


13.2 


52.8 



The short sentences are identified (mean 55,91) and discriminated 
(mean 69.41) sofnewhat better than the long sentences (the respective 
pafcentages 57.8% and 48.1^), The diffirence between the short and the 
long sentences is slightly greater in the discrimination test about 121) 
than in the identification test (BS), In general, the short sentences 
(62*6%) are better identified and discrimTnated than the long sentences 
(52,8%)* The difficulties in the analysis of the structures of the long 
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sentences is a pTobabIt ,ixp]anati dji for the results. Ha^'/§vfr, the 
differences betwttn sh^rt ird long ioMtencfes are sfnEll because niiry 
inter-vemng factors blu*- the pfctury. For exarnDle, the ncQurrence 
Of itresses on ths "cr-ltfcaT s..Qrds" jnrf some idiolectal f^hlurm dtitort 
the result. 

Thf problems in the aralysis arid recognitlDn of the 5tructur»3 of 
the aeritences can raisi idinti f ication^ and distri^ninatlon problenvs^ 
This is espscially evldefit in juncture cppositlDns UN T Knciv tlm 'Xa/ 
tc cut it 1 irfv.':v rne i':a;Ltcn cuf Lt, 1 uil - f/u: nwat > I thcjn 
or Thcif ^-isc Uannif ' TUcif ^tcCff Anhu . The Eitructural felationshlp 
between tno rnembers is complex in tht sense thai: no onf^-to=Dn& forml 
correspondence ixists beueen then. Informants must be able no see 
what possibilities thgre are of dividing the segTievitany similar sentences 
into two formally ^nd semantically different un.iti. The sentence stress 
oppositions relate to underlying structural riiiff erences . For e>a^,iD]e, 

(farjott'!.iS ■ :irer iiiust J^tallze that the sentence structures are 

In other words, the sentences ccnsi sting of rhe san:e iihonerre string 
can have at l^ast those two underlying stryctures. The problem is 
especUlly acute with such pairs as tUv,- rcu the u'/ufc '/it-u^c- / 

The fcrvc 'cahc. .t s/vctar .^^v, where it is c^ifftcuit to know the 
function of the sentence stress without earlier experience (ie. without 
knowing. Tor exanple, that the stress on Mtc vmpUes 'The ^<hite House 
in Washington' and the stress on Iwiibc refers to 'a house painted white'. 

The fact that certain forms are preferred influences the identi- 
fication. For exainple, 1n the pair Jhctj a-.c 'viaijui^ caxd^ ! ihuj aie 
^lat^inq 'cand^ the inforrnarits tend to 'hear' the latter member more 
readily because they associate these sentervtes with the present progressive. 
This is due to the fact thirt the present progressive is rntroduced at the 
beginning of the English studle? and the pupils have a strong tendency 
to use it even in cases; where it is not appropriate (see eg. Kohonen and 
Saleva mSi 14), 

The Informational value of the test sentemces can be linked up with 
the identification and discrimination problems. The Infonnational value 
is like the markedness tJheofy in itnguistics (see p. 179 above) i 
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he message carried a sentence Is ''UfWiarked" If it does not giva any 
JSuii, new Infoniiatfoin but "rnarked'' if sofr^athing new md perhaps une)i- 
:ted Is pressnte^i. In other words, for examplej driving a car Is not a 

of niwi hut driving a car in such a way that an accident tikes plice 
new and infonnati ve. In the pair I pickzd It tip and it tcaCt'i/ u'lii ccCd 
id th3 latter m&mhBr more 'populaf ' because it is nore uneMpected and 
.nuiual to pick up iol den things than to come across something cold. The 
jyite high percentages of the third alcernative It <i a s/uh<"ci hijuic in 
the case of It u'Ai at cu.i ' Iwiiho. I It taii a tcwi hciLhc is obviously 
related to the interest In sofnething unusual * something that has nots 
perhaps, even been met before. More or less similar is the popularity 
of ^rx in th" pair t^avc ijou bcm tk^ ^ciki f ^cx thexcf In principle, 
however s the informational value of the sentences differs to a cirtai'n 
e^ftent according to the background of the informants , 1e. the infonnatloMal 
value of the §oCd sentence is obviously quite small for a goldsmith , for 
example. Therefore the central deternilner of the informational valuci Is 
to be found -r the earlier experience of the world and in its influence 
on the individuals concerned. 

jfiQ rgferentla^ appropriateness, of the tfjst sentences affects the 
perception process. It is important for a language ^je^rner to master the 
sense-relatlorv rules (1e, the relation between fonn and meaning), but It 
is also irriportant to be acquainted with the rtnerence=re1 ation rules 
(see eg. Corder 103 - 104). The hearer must alwa^^s be aware of 

how the sentence meaning relates to the surrounding world, and the concepts 
about the surrounding world are to a large extent culturally based (Corder 
1973: 104). 

An example of what Is not usually done In a culture 1s found In the 
latter member of the pair Tliey mc 'kiLnting doa:^ / uantuy 'f^ugij it was 
poorly identified because of it being unusual that dogs are hunted. The 
knowledge of foreign cultures may also ciuse reference difficulties. The 
pair ingioiid U i^Ofnooi go^ ItA chip^ / ^kipb gathered a lot of hhip^ 
identifications, partly because the pupili know that Enqland ii island 
and it Is known for Its great seafarers. 

The abstract - concrete dimension that the test sentences occupy 
Is in direct relation to the perceptual process. The concrete reference 
of a sentence is apt to increase the probability that it Is chosen in the 
identification test* in spite of the fact that the thinking process of 
the informants is no longer based on that of concrete operations (see eg* 
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Helnonen 1972: 78 = 31; Carroll 1970; 135 = 138). The importance of 
concretiness far school learning is emphasized for all age groups 
{see eg, POPS I 1970: 102; Koort 1973: 214 = 215; Ausubel and Robinson 
1969: 183 - 190) in such i way that teaching should advance fron concrete 
to abstract because concrete reference is iasier to grasp. For exainplf, 
in the pair HL^ bid / bed u tc^c kigli the second one is more concrete 
and nearer to the informants' sphere of experience and therefore it 
better identified. Some other examples in which the concreteness of 
the first member has gathered more identifications are the following- 
It a the bigQcht taa-pot / tQapoit in the iCciCd. Thcu ahkad Tom 
coma I catm doiou I pickad it up and it -tCaiUu toai coCd / goCd, It icai 
at cut iwiL^z ! a tomx hcma and Hlh <ace ^ iUth I'm net tha ^boim si 
taicu. The concreteness - abstractness is naturally not the only 
pragmatic factor that influences the identification of these pairs; 
for example, in cold / gctd the informational value of the sentences 
plays a role. In general we cannot dividel the concrete - abstract 
dimension into absolute categories. Instead we are concerned with degrees 
of abstractness or concreteness. 

The semantic content of a sentence can also vary according to the 
truth dimension. Some sentences express general truths, ie. something 
that Is always true, whereas some others refer to occasional truths* 
ie. a truth that holds only under certain limited conditions. For example, 
in the pair Tho. bajh hptnd I hpmt tcr much mawu jf t^eefcejtrii the first 
member has to do with the genera 1 truth that the boys spend a lot of 
money at certain intervals but the second member only refers to a past 
period. 

The expectancies and presuppositions about the coming message often 
influence the identification and possibly also the discrimination process, 
^S Bruce (l?e8: 131) puts it, we hear what we expect to hear. The frame- 
work for this kind of phenomenon is provided by the redundancy found in 
speech. We do not hear the code in its entirety but sample it and make 
predictions on the basis of our sampling (see eg. Sutherland 1966: igi), 
Corder (1973: IZO) calls this a heuristic model. According to Curney 
(1973: 96), only 60-80 % of any communication is heard. Therefore know- 
ledge of the context and of the syntactic regularities makes completion 
and comprehension of a speech sequence possible, At^ the first stage of 
language studies in particular the unprcJictability of utterances is 
a source of difficulties. Therefore the learner must attempt to use the 
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sentince structure and the general setting to get a fair idea of what 
1s to ccne (see eg, Curnty 1973^ 9B\ Lehtonen* Sajavaara and May 1977: 
18). Quite often the expectancy errors are due to the extreme fanilia- 
flty of the key words or of tne whole test sentences; the Infortnants 
identify only the most familiar alternatives, excluding the others. For 
eximple, in the pair It tcai juiC a h^'aCC cut f cct, yea ice the third 
alternative If '^w jiUt a iJriaiC cat^ y^Ki ice was frequently chosen because 
of Its familiarity, 1e* the InfonTiants think they hear it despite 1t^ 
clear linguistic difference from the other alternatives* Other pairs v/hose 
first me*^bers are very familiar and which have been extended to the scope 
of the other two alternatives are He liab gi't a bike / pike. Have eeti 
th^ cL'irw ' CL^t^at mij mv: hhixVi ^ and I L^lt m>j hcaif / luit u: 
PiAncitcc . The low discrimination percentages Of the pairs mentioned 
above (mean solution I 43) may be an Indication of the anticipatory 
effect i ie. tht informants tend to 'hear' only the most familiar alter- 
native. Sometimes a centre of Interest may lead to an expectancy effect; 
in the pair He U inters ^ ted in ipat^ ' ^pct^ the %pi.'\ti alternative 
was popular because of Its greater Interest for the informants. Generally 
speaking the expectancy effect Is* to some extent * present in the Identi- 
fication of all the test items but in some cases its influence is more 
marked. 

tns social content of the sentences is liable to direct the Identi- 
fication process. Certain alternatives may socially more acceptable 
than others. The extent and quality of the social explanations seem to be 
dependent on the informants' age, because the susceptibility to environ- 
mental influence is different in various age groups* 14-15-year-old 
children (like our informants) are especially subject to environmental 
Influence (Heinonen 1972: 170 - 171). They have a tendency to become 
members of peer groupi and to accept itb roles and norms. Thus the social 
values of the peer group may conflict with the values set up by the 
surrounding society. For examplei the high identification percentages 
of the first members of the pairs tC/igtc ci ljqllx QL%hi / c£aii, ijcui 
gin / chin ittong mough and the frequent choice of the third alternative 
She U tooting ioK hm ^ImxAtj in case of the pair Ska u looking ^c^ hcA 
dmwj / Jawj reflect the intention to break the taboo connected with 
the uie of alcohol in Finland, 
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The person-centred iniage given by the test sentences also 
influinces the identification pfQcess, ObviouFly a reference to 
people is somehow experienced as more affective than that to various 
impersonal objects. To some extent this phenonienon may be related 
to the stage of the informants' social development favourable effect 
of the person-centred images is seen in the greatly varying identifi- 
cation percentages between the members of the pairs She locking ic-i 
hit-i ckeAxy / Jeity and He. Talkgd aboiLt /lU diain / Jmic. The occurrence 
of the proper names quite obviously fortifies the person-centred Image, 
The high identification percentage of Tad (88 ^) in the pair The man 
U Ted / dead seems to belong to the same category, 

A great number of discrimination and identification problems are 
related to various idiolectal features in the test sample. As Lehtonens 
S^Javaara and May (1977- 24) put it* "many elements found in the 'ideal' 
pHr- -logical representation of the message are simplified or deletedi 
Qf.u it is this simplified phonetic chain of utterance that language 
learners are faced with in a normal communicative situation. Though 
the testing situation of this study does not correspond to a natural 
coninunicati ve situation and therefore various pragmatic features like 
gestures and facial expressions are not available, some other idiolectal 
features ensure that the test sample is not an ideal representation of 
speech as described in textbooks. Factors such as speaking rate* tempo, 
rhythm, loudnesSi linkings assimilation and reduction make the lingutstic 
cues of each particular opposition prominent or less prominent (see eg* 
Lehtonen, Sajavaara and May 1977i 24; Gimson 1966: 3-4), The juncture 
cues in particular are subject to idiolectal variation to such an extent 
that they may even become neutralized. 

The most overt and frequent idiolectal factor in this study is the 
placement of the sentence stress on the key words* ie, the words containing 
the main linguistic identification and discrimination cues, (No predetermined 
stress pattern was provided (except in the case of sentence stress oppo- 
sitions) and no situational reference was presented to the speaker of 
the test material.) When the sentence stress happened to fall on the 
key words, the perceptual process was facilitated whereas the lack of 
the sentence stress on the 'critical' words had a contrary effect. The 
mean discrimination and identification percentages of the pairs that lacked 
the 'criticar sentence stress (ie, mt-cot, hiLt-kmnx, aoma'CCilm, pmah- 
bmchf &p2nt-4ipmd, cold-gold, tfLif3&-d/Up6 and watj to^ifJAltm] are 51,5 % 
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and 4US whereas the respective figures for the entire tests are 64 = 4% 
and 51.5 Thus both dlscriminatfon and idintification problenis seem to 
be related to the presence or absence of sentence stress on the key words. 
The lack of sentence stress also causes various reduction phenomena* which 
naturally weaken the linguistic cues in question. Another factor that 
contributes to the picking up of linguistic cues is the overall "wb- 
fulness" of the pronunciation in the test samples. For example^ in the 
pair piuicfiji - pwicn 5 the difference between the nembers v/as emphaiized 
and nearly all the Inforniants (06 %) perceived this distinction. No such 
effect was present in the pair tvj — dxy and the perception of the difference 
was very difficult (15 %). 

The effect of Swedish on discrimination and identification is not very 
extensive In this study but it is important as a potential cue, because the 
Informants had studied Swedish as their second foreign language for three 
years. Firstlyj Swedish Influence is apparent in the perception of the 
opposition /// - /tj/. In Swedish the corresponding sounds [/] and [tj] 
are in free variation (see eg- Stubelius 1069: 90). Secondly, in the 
identification test the word Ictk was confused with the Swedish word Hockf 
which means 'shutter'. The identification has taken place on the basis of 
the formal correspondence between these words * and therefore the hypothesis 
that Finnish speakers use the spelling as a point of reference gets more 
Support. 

The pragmatic cues can be divided Into various categories, for eKamplei 
syntactic, semantic* orthographical * social* psychological* etc.s but the 
common denominator for all these factors is the degree of familiarity of 
the different structures present in the test sentences. The familiarity 
may be related to words* sentences, orthography, meaning or social context 
of the test items. It is difficult to estimate the relative importance 
of the linguistic and pragmatic cues because their Importance varies from 
context to context. The influence of the linguistic cues is strongly 
related to the idiolect of the test sample. Even though the pragmatic 
cues in one fortn or another are nearly always present, their effect is 
emphasized If the linguistic cues are weak, which does not, of course, mean 
that the test sample is badly pronounced. As a whole the pragmatic cues 
reported In this Study are only the tip of the "iceberg" that is found 
tn a natural speech situation. 
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Discrimination and Identification tests gnd pronunciation skills* - 
This section will deal with the appl icab il , ? / of the perceptual tests 
used in this study to the measuring of the pronunciation skills. 

The comparison of the discrimination and identification skills 
is here carried out by means of the correlation matrix presented in 
Figure S. The discrimination and identificavior test results have 
been correlated with each other and with certain background variables. 
Among these background variables the mean of the marks in nl; the schunl 
subjects indicates the general success at school. The school marks in 
English show the English skills. The mark In mu^ic has been included, 
because an aptitude for music may, to som^ e)<trnti be connicted with 
the discrimination and identification skills. The production variable 
P) consists of the results of the productiOfi test. 
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Figure 5, The intercorrelations computed between the tests uied in this 
study and certain background variables. The variables are 
as followsi E - marks in English, H - marks in music, MN - mean 
marks in all school subjects, D - scores in the discrimination 
tests I - scores In the identification test, V - scores in the 
vocabulary test and P = scores in the production test. 



The correlations are not high (all less than 0,80), This is partly 
due to the fact that the tests were primarily designed for the study of 
the discrimination and identification of certain phonological oppositions, 
while the measuring of the pronunciation skills was only ^. secondary aim. 
For Instance no pretesting was carried out. Another factor lov/ering the 
correlation coefficients is the quite small scatter of the scores within 
both perceptual tests i the discrimination test scores ranging from 30 to 
40 (maximum 60) and the identification scores from 52 to 97 vn'th one 
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expectton (maximum 120), The corrisponding figures for the vocabulary 
test are 87 to 127 (maximum 131) (sei Appendix 2). The small variations 
tend to lower the correlation coefficients (lee eg. Peltonen 1970: 44; 
Koponin 1976.' 28), 

The corrilatlon between the identification and discrimination test 
Is quite low, 0.29i which shows that there is no great correspondence 
between them. Obviously they measure different skills. From a psycho- 
linguistic point of view the situation can be illustrated by Corder's 
model Of the receptive skills (1973i 117 - 124), which is presented in 
the form of a schema in Figure 6, 

UNDERSTANDING 

IDENTIFICATION 

f 

RECOGNITION 

t 

DISCRIMINATION 

Figure 6. Receptive activitiei from a psychol inguistic point of view 
according to Corder (1973: 117 - 124), 

All the skills (discrimination, recognition, identification and under- 
standing) presented in Figure 8 are necessary for the understanding of an 
FL message. The skills are hierarchicany ordered in such a way that 
identification activity, for examplei presupposes recognition and discrimi- 
nation ski 1 Is. 

The discrimination test used in this Study can be seen as an equivalent 
to the discrimination activity in Figure 8. It presupposes auditory skills 
only* 1e, the listener must be able to 'hear' certain sound phenomena (like 
the degrees of intensity, pitch, duration and quality) as distinct from each 
other. The discrimination process is therefore a passive activity because 
'hearing is not doing anything - it is Just something that happens to you' 
(Corder 1973: 117). Naturally the making of a distinction between various 
sound phenomena presupposes some concentration but no cognitive skills are 
neededj The discriminatory skills are not specifically linguistic, because 
they are needed in making sense of the 'world of sounds' in general. 

The identification test in this study contains all the activities 
mentioned in Corder's hierirchy. The starting point is the discriminatory 
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skill. For addition j the listener must possess the ricognitioh ski 111, 
le, he must ba able to put the incoming data into the schema that he 
has acquired by means of intuition and experience of language in general. 
The listener has internalized certain hypotheses against which he tests 
the incoming perceptions and locates the sensations he receives in the 
abstract model called 'perceptual schemata' (see eg. Corder 1973: 118). 
For examplei the recognition of an FL segmental phoneme means that we 
are able to classify it, ie. to match the input data with stored represen- 
tations. Therefore recognition is an active cognitive process which 
has at least something to do with problem solving ability (see eg. 
Gregory 1970: 30), 

At the word and word-group level the incoming data can be compared 
with the stored schemata but at the sentence level this is not possiblej 
because all the sentences hear have no equivalents in the models we have 
stored earlier. Instead we have stored a limited number of rules govern- 
ing various sentence structures. Every sentence is analysed at the 
identification phase. According to Corder (1973: 120), this is done 
heuristi cal ly 4 ie. by predicting the structure of the utterance concerned 
by means of sampling* 

Final ly* the listener must be able to understand the message he 
receives in its particular context and in this way realize the communi- 
cative intent of the speaker, This skill, (not far from) pragmatics, 
is called understanding in Corder *s model. Because the identification 
test in this Study presupposes, not only the discrimination and recognition 
of phonological Structures* but also the identification and understanding 
of the tost sentences i it comprises roughly speaking all the skills in 
the hierarchy above and is* rontrary to the discrimination test, a strongly 
cognitive activity. This becomes evident in the way in which the 
discrimination and identifier Hon tests correlate to the other variables 
in this study (see Figure 7). 

Firstly, the identification test correlates better to the apparently 
cognitive English skills (0.69) than does the discrimination test (0.21). 
The situation is the same with aptitude for music (0.39 for identification 
and 0.15 for discrimination). In the light of Corder 's model aptitude for 
music presupposes at least some kind of recognition activity. General 
school achievement correlates better with the identification test (0.66) 
than with the discrimination test (0.14). The vocabulary test that 
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presupposes somi cognitive mastering of English does not correlate with the 
discrimination test (0.02} at all but to some extent with the identification 
test (0.42). Thus the mastering of the vocabulary of the structures tested 
is not of great importance in the discrimination process i which supports 
the view that the d1scr1rnination test measures hearing ability^ even though 
the detailed study of the test items showed that the familiarity of the key 
words made it difficult to perceive certain distinctions. Some informants 
may have used the 'probable' meaning of the words and sentences as their 
perceptual strategy. In addition, the production test correlates better 
with the identification test (0.70) than with the discrimination test (0.26), 
The productive test is cognitive in the sense that the informants must 
know the relationship between English orthography and its phonological and 
phonetic representation. 

The above results are supported by the findings of Moisio and Valento 
(1976: 82). Because the discrimination test measured hearing ability only* 
such variables as language skills and attitudes towards English did not 
correlate as strongly with the discrimination test as with the analogy 
tests. Further* Moisio and Valento (1976: 8i) found that learners of 
German got as equally high scores in the discrimination test as learners of 
English and that the background variables which were not concarned with 
language skills proper correlated better with the discrimination test 
than with language-skill variables. In Hirvonen's study (1971: 34)* 
too* the correlations between the discrimination and the identification 
tests were quite low. Lehtonen and Koponen (1977: 84) report a correlation 
of 0.52 between the mark in English and the linking of words. 

That the discrimination test is relatively easier (solving % 64.4) 
than the identification test (solving % 51.1) probably reflects the 
mechanical nature of the discrimination test and the cognitiveness of 
the identification test (see Appendix 2), The same tendency was apparent 
in Moisio and Valento*s (1976: B6) study* where the solving percentage 
for the discrimination test was 74.3 % and for the analogy tests 53.4 
and 57.3. 

The results show that the identification test presupposes more 
complicated cognitive processes than the discrimination test* which 
could perhaps be replaced by a test measuring the discrimination of 
various sounds and noises which have nothing to do with linguistic 
distinctions. Thus the identification test clearly measures language 
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skills the maitiring of the FL)* whereas the discrimination test 
measures hearing ability only. According to Lghtonen (1972a- 21), 
a speaker with normal hearing can, at least after some e^cercisej 
discriminate between various FL sounds In the same way as he discriminates 
between various colours. But in FL learning this Is not inough. The 
learner must also be able to Identify the FL sounds and to realize that 
certain new phonetic cues may completely change the meaning of the 
utterance In question. In other words, the discrimination test does 
not necessarily measure the mastering of the foreign language sound 
system (see eg. Lehtonen 1972b^ 18). 

The scope of the tests can also be looked at from a linguistic point 
of view, 1e. to find out what linguistic levels (phonologyi syntax, 
semantics) they have to do with. The discrimination test is related to 
the phonological level only, but It does not even cover the phonology 
as a whole. The identification test consists of phonological * syntactic 
and semantic component. This is probably one reason for the claim that 
a test based on minimal pair sentences measures syntactic and semantic 
skills only (see eg, Tommola 197S: 13). However, the mastering of the 
phonological system Is Integrally related to the mastering of the skills 
presupposed by the other linguistic levels. For example , the linking 
of words and the introduction of pauses in sentences presuppose the 
mastering of syntax as well as the understanding of the functions of 
Junctures and sentence stress. The grouping of the sounds into phonemes 
is based on semantic principles (see eg. Lehtonen 1972a: SI 4 MacNey 
1971: BO). Further j the mastering of vocabulary may make the identifi- 
cation of individual phonemes easier in the same way as the understanding 
of phonological rules does. However, the identification result is not 
fully predictable by means of the mastering of the key words because 
the correlation between the Identification test and the vocabulary test 
is only 0.42. Some other factors like the redundancy with its expectancy 
function, soda! preference, informative value, or familiarity interfere 
with the mastering of the sound system. Therefore the linguistic levels 
should also include pragmatics (Lehtonen^ Sajavaara and May 1977: 180). 
The mastering of the sound system is not a mere sum of the phonological " 
distinctions and identification cues. Its connections with the skills at 
the other levels of language are quite obvious. Thus, for exarnple, the 
importanci of the individual phonemes in sentences or In natural speech is 
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rilatlvely small * becaus^. thiy can be predicted by means of cuss other 
than specifically phonological (phonetic) ones. 

The Identification test centalni a certain number of skills related 
to communicative competence though It is not concerned with an Interaction 
process proper between the speaker and the hearer ^ while the discrimination 
test has no strong connections with comniunicative £un^pU3tence. The discrimi- 
nation test procedure can be described by means of Corder'i (l«?73i 359) model 
presented in Figure 7. 



Input: Test materials 
1 . Linguistic data 



2. Test 1nst.*^uctions ( 'prograniTie ' ) 



Language 
prncessing 
devi ce 



Output 



Test responses 



Figure 7. The discrimination t^st procedure. 

According to this model the ifiput comi^x^ af tha linguistic data and 
the test instructions that are handled by tN U-ngjage processing device 
of the learner and the result output) is the test responses. The data 
in the discrimination Ik'S/ passes through the language fjrocessing device 
without being influenced by other factors. The situatign is diffirent 
in the identification test* which can be Illustrated by Corder's (1973: 
363) model of a test measuring cogmunicatlve competence (Figure S). 



Input 



Output 



Linguistic data 
Situatiprtdl 'Jata 



Test instructions 



General cognitive 
capacities etc. 



Language processing device 
formation rulPS= knowledge 
of culture and societyi 
"speaking rules" 



Test responses 



Figurfi (5, The identification test proceduri. 

Th^ input data consists of the linguistic data (- the phonological 
oppositions) and the situational data (^ in this case* various pragmatic 
factors and the test instructions). The language processing device 
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contains the liarner's linguistic compitince (= fOfmation rulis) and 
pirformance (= speaking rules) together vn'th the Influence of the 
general cognitlvi capacities. The result of the processing of the 
input data Is the test responses. Thus the identification test has 
to do with cognitive skills and with conmunicative competence (although 
the pragmatic cues cover only some of those present in natural speech). 
It also has links with knowledge of society and culture in general. The 
discrimination test* however, lacks these components. 

The relationship between the discrimination and identification 
tests and the productive skills is an Interesting problem. There is 
a consensus of opinion en the fact that the receptive and productive 
processes are related to each other (see eg. Carroll 1970: M\ Wangler 
and Weiss 1975: 196; Corder 1973: IZi). For example, the auditory 
feedback is needed in the production process for the control of the 
articulatory movements. However* it is obvious that the receptive and 
productive processes do not completely correspond to each other. According 
to Lehtonen, Sajavaara and May (1977: 18 - 19), a complete correspondence 
would mean that the "interpretation of the message takes place after the 
hearer has received the acoustic cues contained in the surface form of 
the message". The native speakers however, uses his ability to predicts 
and the perception takes place by means of all the available linguistic 
and non-linguistic cues. Therefore, neither the receptive process nor 
the productive one are based en the 'Ideal' surface representation 
of speech! many intervening factors complicate these processes (see. 
eg. Corder 1973: 123). 

The results of this study show that the connection between the 
receptive and productive skills depends on what kind of processes these 
skills presuppose. The Identification test measuring integrative and 
cognitive skills correlates better with the production test (0.70) than 
with the discrimination test primarily measuring mechanical auditory 
skills (0.25). Thus the ability to discriminate does not necessarily 
imply any productive skills (see also van Teslaar 1965^ 88; Tommola 
1975: 6). The strong correlation between the identification and pro- 
duction tests may be due to the fact that both tests presuppose some 
cognitive and integrative skills. An indication of the cognitive and 
integrative nature of the productive skills is the high correlation 
(0,65) between the identification test and the English marks. Further, 
the correlationi between the identification test and Swedish marks (0.77) » 
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music (0*43) and gineral school achitvement (0.77) are quiti high. The 
high eorrelationi between the productive and receptive ^^kills obtained 
by Muniill (I973i 46, 54, 103) and Kjenmer (1968i 138 - 139), 0.78 and 
0.58 respectively, are obviously explainable by the same kind of reasoning 
as lies behind the high correlitions of this study. 

The low correlations between the discrimination test and the production 
skills are probably due to the fact that no parallel exists between reception 
and production in regard to cognitive pipcesses. Thus if we need a receptive 
test capable of predicting the production skills through which the message in 
the orthographical fonn will be translated iHto speechi we must construct 
a test similar to the identification test used In this studyi it must consist 
of the varipus cues necessary in the interpretation of the message (cues 
related to phonology i morphology i syntax, semantics and pragmatics) (see eg. 
Kohonen 1976^ 39^ Tommola 1977: 25 - 26^ Pilliner 1977i 14; Kohonen and 
Saleva 1978: 18), It is impossible to say if reception is a precondition 
for production, or the Other way rounds or whether the relationship between 
these skills is interactive* However, an awareness of all the factors 
Influencing the understanding and production of a message is necessary for 
an FL learner. 

The correlation between aptitude for music and the other variables 
varies a lot; between it and the rnark in English (0.27)* general school 
achievement (0.2?) and the discrimination test (0.15) the correlations 
are quits low* Between it and the identification test (0.39) production 
test (0.44) and vocabulary test they are however somewhat higher (O.iZ). 
Lehtonen and Koponen (1977: 84) got a correlation of 0,48 between aptitude 
for music and the preference for linking. In Lehtovaara's (1974: 50) 
study i aptitude for music was better correlated with the linking of sounds 
(0.32) and intonation (0.35) than with discrimination (0,06). So it 
seems that aptitude for music is not very strongly related to discrimination 
but somewhat better to the variables having to do with skills of a more 
cognitive native. On the other hand, general cognitive capacities like 
school achievement do not get very high correlations. Therefore we can 
assume that aptitude for music is related to the same area of the 
comnunicati ve process as the phonetic identification cues* ie. the use 
of the phonetic cues has SOTiething to do with aptitude for music. 
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CONCLUSIONS 

Thi great numbif of factors Influencing the receptive pronunciation 
skills refttcts the complex nature of these skills. We are mt only 
concerned with interpreting tfie phonetic surface structure of an FL 
utterance but we are supposed to have an abilitj^ to cope with all the 
factors present in a partiiEUlar rtception situation. 

It is* nowewer, Impossible to say which kinds of factors are more 
important, phonetic or pragmatic.: We may only assume that the use of 
the phonetic identif Icatiion cues is greatest at the beginning of FL 
learning and that their importance decreases when the studies advance * 
while the Importance of pragmatic cues increases. It may also be that 
when the speech situation becomes more natural, the use of pragmatic 
cues increasei. Thereforei tests that measure the mastering of phonetic 
cues are not comprehensive enough, even though the phonetic factors 
as such may constitute an important source of receptive difficulties. 

The use of the ppagmatic cues in the perceptual process presupposes 
the learner's ability to operate on both the syntactic and the morpho- 
logica] level; operations with the phonetic surface structure should be 
so far automatized that the learner need not consciously concentrate 
on the picking up of the information it carries. The reception should 
be based on sampling and on the use of the redundancy present in speech. 

From the language teaching point of view what is essential Is how 
these receptive skills can be developed. All the factors playing a role 
in the receptive process should be taken into account. The materials 
should consist of many varieties of the FL* so that the interpretation of 
the phonetic cues would become unconscious and automatic. The inter- 
pretation of the message should be based on morphological ^ syntactlCi 
semantic and pragmatic cues. Moreover, the cognitive and communicative 
nature of language learning should be emphasized. The lack of attention 
to cognitive skills can be seen, among other thingSi in the introduction 
of the direct and audiolingual methods; for e^cample. In a pilot study 
by the present writer the number of cognitive techniques used in teaching 
pronunciation and semantics amounted to only 28 % of all the methods used. 

At present, the teaching of English pronunciation cannot be satis- 
factorily Carried out in Finnish comprehensive schools. During the first 
three y|arSs when the teaching of pronunciation could be appropriate 
because of the pupils' good ability in sound perception and imitations 
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tht groupi to he taught arm too big or, especially in the country, are 
compoied of two groups at different lavcli (eg. the fifth and siMth 
forms) are taught f imultaneoufly. If the time available for efficient 
teaching is about 20 mlnutiS twice a week some grammar and vocabulary 
should also be dealt with, the teaching of pronunciation remains minima" 
At the beginning of language studies, listening exercises would be 
ifflportanti but the pupils cannot concentrati on mere listening* because 
they want to express themselves in the foreign language. Thus, althougl 
the language learning process would ^i promoted by listening exercises, 
various pedagogic reasons do not always favour this. 

At the later stages of English studies the problems are related to 
big teaching groups , to the small number of English lessons and to the 
lack of teaching aids. 
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APPENDIX K The material of the pfesent itudy. 

1 . I fee! it now. 
I fill it now. 

Z. The heel is very high. 
The hill is very high. 

3. He wanted to leave. 
Hi wanted to 11 ve. 

4. I saw the sheep there. 
I saw the ship there. 

5. This is your wills Isn't it? 
This is your wellp Isn't It? 

6. The rubbir Is on the disc, you see. 
The rubber is on the deskj you see. 

7. My pins are not here. 
My pens are not here. 

8. His bid is too high. 
His bed is too high. 

9. It is the biggest tea-pot in the world. 
It is the biggest tea^port in the world. 

10. He is interested In spots. 
He Is Interested In sports. 

11. Have you seen the fox there? 
Have you seen the forks there? 

12. It was just a small cot, you see. 
It was just a small cut, you see. 

13. Have you seen the collar of my new shirt? 
Have you seen the colour o^ my new shirt? 

14. His lock is very good. 
His luck Is very good. 

15. The bun 1i too big. 
The barn Is too big. 

16. I left my hut in San Francisco. 

i left my heart In San Francisco. 

17. We asked Tom to come down. 
We asked Tom to calm down. 

18. The zoo's luck was a popular talking point. 
The zoo's lark was a popular talking point* 

19. Look at those peas in your garden! 
Leek at those bees In your garden' 

20. John liked the peach very much. 
John liked the beach very much. 

21. He has got a pike. 
He has got a bike. 



- 222 - 



2E, Tha man 1a Tad. 
Tha tnin la dtad, 

23. Hi ufid to wrfte wftin he was young. 
He used to rtdi when fie was young ^ 

24. They built the housi vary well* 
They build the houie very well. 

25. The boys spent too much money at weikendi. 
The boys spend too much money at weekends. 

26. You've got writings on your baek, haven't you? 
You've got writings on your bags haven't you? 

27. I picked It up and it really was cold* 
I picked it up and 1t really was gold. 

28. His coat is very eld* 
His goat is very old. 

2i, Where is your class? 
Where Is your glass? 

30. She is looking for her cherry. 
She is looking for her Jerry. 

31* He talked about his chain* 
He talked about his Jane. 

32, is your chin strong enough? 
Is your gin strong enough? 

33. Let's try it. 
Let's dry it, 

34* John liked the trips very much, 
John liked the drips very much. 

35. When I was sitting in the boat I started to sink. 
When I was sitting in the boat I started to think. 

36. He has not got the same face as before. 
He has not got the same faith as before, 

37. He is washing the baby. 
He is watching the baby, 

38. England is famous ^ its ships. 
Ingland is famous its chips* 

39. Tom doesn't understand his friends at all, 
Tom doesn't understand his French at all. 

40* His tulips are red. 
His two lips are red. 

41* I saw the meat* 
I saw than eat. 

42, They have got peace talks there. 
They have got pea stalks there* 

43, That princess is our friend* 
That prince S is our friend. 




44. I know the way to cut it, 
I know the waiter cut it, 

45. Her dol] has got a grey tie. 
Her doll has got a great eye. 

46. It was at our house. 
It was a tower house. 

47. This nitrate is expensive nowadays. 
This night-rate is expensive nowada 

48. That's our milk. 
That*s sour milk. 

49. It 1s a name. 
It is an aim, 

50. Thiy free Danny. 
They freed Annie, 

51. I've got twenty-six ones. 
I've got twenty sick swans, 

52. My newer dress is here. 
My new address Is here, 

53. Have you seen the 'White House? 
Have you seen the white 'house? 

54. 'Fiymg aeroplanes can be dangerous 
Flying 'aeroplanes can be dangerDus 

55. John gave her dog 'blscuit^^ 
John gave her 'dog biscuit = 

56. They are 'hunting dogs. 
They are hunting 'dogs, 

57. The 'book-case was a special one. 
The book 'case was a special one. 

58. 'Visiting friends can ve dangerous. 
Visiting 'friends can be dangerous, 

59. They are 'playing cards. 
They are playing 'cards, 

60. They are * frying chickens. 
They are frying 'chickens. 
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APPENDIX 2, The acores achieved by the informants, in the discrimination » 
identification, vocabulary and production tests together with 
the iubjects' background information. The iymbols used In 
the table are ai fallows: D stands for the discrfmlnation, 
I for the identification, V for the vocabulary and P for 
the production test. MN, E and M give the informants' mean 
for all the school subjects, their mark in English and 
their mark in music. 



Infortnant no Tests School marks 





D 
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V 


P 


MN 


E 




M 
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39 




54 


109 
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5.7 


5 




8 
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37 




60 


89 


4 


5.5 


5 




B 




3 


41 




69 


115 


5 


6,7 


9 




8 
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41 




59 


101 


7 


7.6 


8 




5 






43 




69 


106 


12 


7.6 


7 




8 
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35 




70 


106 
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7.8 


8 




8 
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43 




63 


in 


10 


7.7 


8 




9 
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36 




62 


991 
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8.1 


7 




5 




9 


44 




97 


127 


14 


8,8 


9 




9 




10 


31 




67 


111 
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7.3 


8 




9 




11 


46 




72 


108 
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8.5 


9 




9 




12 


33 




69 


119 


10 


8.3 


7 




8 




13 


35 




67 


109 


14 


8.Z 


8 




9 




14 


40 




74 


in 
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8.7 


9 




9 
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33 




47 


377 
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6,7 
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15 


39 




75 


115 
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8.3 


8 




8 
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38 




62 


67 
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5.7 


6 
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35 




52 


95 
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6.5 


5 




8 
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49 




58 


64 
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7,1 


7 




7 
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45 
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115 
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8 




8 
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73 


103 
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7.8 


8 
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38 




53 


105 
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7,9 


9 




7 




23 


47 




60 


107 
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7,3 


7 




8 




24 


35 




60 


98 
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7.0 


6 




8 




25 


36 




37 


106 


3 


6.6 


5 




7 




Maximum 


60 




120 


131 


15 


10 


10 




10 




Mean score 


38, 


.3 


64.1 


103.0 


8.6 


7.47 


7, 


,28 


7, 


/2 


Stdev 


5. 


.02 


11.51 


14.59 


3.19 


0.87 


1 . 


,37 


1 . 


14 
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APPENDIX 3. The rgsults of the discr iniination, identification and vocabulary tests. The letters IT stand for 
the Item tested. D stands for the discrimination test results divided into' ON, the number of the 
item in question on the answer sheet; DFCi the frequency of correct ansvvers; and DPCj the percentage 
of correct answers. I stands for the identification test results, the subgroups being IN the 
number of the item on the answer sheet), IPC the frequency of correct answers)-, IFO the 
frequency of answers gathered by the opposite member) * IFT the frequency of answers gathered by 
distractor alternative}* IPC (- the percentage of correct answers), IPO (= the percentage of the 
answers gathered by the opposite member), IPT (= the percentage of ansv/ers got by the distractor 
aHernative)* The letter V stands for the vocabulary test risults and is divided into VFC the 
frequency of correct answers in vocabulary test), VPC (= the percentage of the corrict answers). 
The letters PT stand for the mean pircentage of the correct answers in all the tests (ie. the 
discrimination + identification + vocabulary test) in the case of each item. 
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fill 
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20 


80 
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8 


14 
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32 


56 


12 


18 


72 
96 


61.33 


feel 
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20 


80 


24 


18 
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72 


24 
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24 


82.67 


hill 
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16 


64 


34 


19 
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4 


76 
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16 


23 


92 


77.33 


heel 
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16 


64 


13 


6 


ID 
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24 


76 
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13 


52 


46.67 


11 ve 


11 


11 


88 


18 


16 
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64 


20 


16 


23 


92 


81.33 


leave 


11 


22 


88 


21 


13 


12 
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52 


48 


0 


21 


84 


74.67 
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ERIC 



DN DFC DPC IN IFC IFO I FT IPC IPO 



IPT 



VFC 



VPC 



Shi p 
sheep 

win 

wel 1 

dlic 
desk 

pins 
pens 

bid 
bed 

tea-pQt 
tea^port 

spots 
sports 

fox 
forks 
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cot 
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40 
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35 
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40 
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40 
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35 
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48.00 


collar 
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28 


4 


21 


0 


16 


84 
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7 


28 


28.00 


colour 
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40 


31 


22 


2 


1 


88 
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24 


96 


74.67 


lock 




18 


24 


96 


36 


13 


g 




52 


36 


12 


16 


54 


70,67 


luck 




18 


24 


96 


32 


23 


2 


0 


92 


8 


0 


22 


88 


92,00 


bun 




1 


11 


88 


1 


5 


2 


la 


20 


8 


72 


19 


76 


61.33 


barn 




1 


22 


88 


35 


2 


q 


14 


8 


36 


56 


8 


32 


42,67 


hut 




13 


12 


48 


23 


1 


19 


5 


4 


76 


20 


4 


16 


22.67 


heart 




13 


12 


48 


20 


22 


0 


3 


88 


0 


12 


21 


84 


73,33 


come 




14 


17 


68 


25 


23 


0 


2 


92 


0 


8 


120 


100 


86.67 


cairn 




14 


17 


68 


27 


2 


21 


2 


8 


84 


8 


6 


24 


33.33 


luck 




15 


13 


52 


26 


10 


3 


12 


40 


12 


48 


22 


88 


60.00 


lark 




15 


13 


52 


30 


1 


110 


14 


4 


40 


56 


21 


84 


46,67 


peas 




41 


11 


44 


79 


17 


3 


5 


68 


12 


20 


17 


68 


60.00 


bees 




41 


n 


44 


S3 


13 


11 


1 


52 


44 


4 


8 


32 


42,57 


peach 




46 


14 


56 


90 


11 


11 


3 


44 


44 


12 


21 


84 


61.33 


beach 




46 


14 


56 


85 


15 


IB 




20 


72 


8 


18 


72 


49,33 


pi ke 




54 


11 


44 


96 


2 


23 


0 


8 


92 


0 


9 


35 


29.33 


bike 




54 


11 


44 


100 


23 


1 


1 


92 


4 


4 


25 


100 


78,67 


Ted 




43 


21 


84 


116 


22 


3 


0 


08 


12 


0 






86.00 


dead 




43 


21 


84 


81 


22 


2 


1 


88 


8 


4 


23 


92 


88.00 


write 




4a 


13 


52 


80 


12 


12 


1 


48 


48 


4 


26 


100 


66.67 


ride 
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13 


52 


115 


14 


9 


2 


56 


36 


8 


25 


100 


69.33 
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DN DFC 
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IFT IPC 
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bui It 
build 


45 
45 


20 
20 


80 
80 


117 

85 


15 

5 


8 
13 


Z 
7 


60 
20 


32 
52 


o 

28 


22 
22 


83 
88 


76.00 
62,67 


Spent 
spend 


47 
47 


10 

10 


40 
40 


119 

87 


8 
13 


15 
11 


2 
1 


32 
52 


60 
44 


8 
4 


23 
23 


92 
92 


54,67 
6K33 


back 
bag 


49 
49 


22 
22 


88 
88 


91 


8 
20 


15 
5 


2 
0 


32 
80 


60 
20 


8 
0 


23 
25 


92 
100 


70.67 
89,33 


cold 
gold 


58 
58 


19 
19 


76 
76 


107 
103 


14 

15 


10 

6 


1 

0 


56 
76 


40 
24 


4 
0 


25 
25 


100 
100 


77,33 
84,00 


coat 

goat 


59 
59 


10 
10 


40 
40 


106 

log 


23 

a 


2 

16 


0 

1 


92 
32 


8 
64 


0 
4 


25 
17 


lOo 

68 


77,33 
46,67 


class 
glass 


60 
60 


20 
20 


80 

80 


112 
122 


10 
14 


15 

9 


0 

2 


40 

56 


60 
36 


0 

8 


13 

25 


52 
100 


57.33 
68,67 


cherry 
Jerry 


53 
53 


14 
14 


56 
56 


95 
104 


6 
21 


IS 
1 


4 
3 


24 
84 


60 
4 


16 
12 


20 


80 


53.33 
0,00 


chain 
Jane 


55 
55 


14 
14 


56 
56 


102 

98 


3 
23 


22 
0 


0 
2 


12 

92 


88 
0 


0 
8 


15 


SO 


2,67 
4.00 


chin 
gi n 


55 
56 


22 
22 


88 
88 


99 
105 


7 

is 


18 

4 


0 
3 


28 
72 


72 
16 


0 
12 


12 
19 


48 
76 
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